FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGT CT GAAGACTAACAT T T TGTGAAGT TGTAAAACAGAAAACC T GT TAGAAATGT GGT GGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
C TACCAT T T ATGT TCGTTAT AAGCAAGT T CATGC TC T GAGT CC TGAAGAGAACG T TATCAT CAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTTT TGCACAGTGGCAATT TTGGGACTGAT TTAGTVACAGAAACTCCAT TGGAACCCCGAGG 
ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGTT TTTTCCTGACT TACATTCGTGATTTTCAGAAAAT TT CTTTACGGGTGGAAGCCAATT TACA 
TGGAT TAACCC T C TAT GACAC TGCACCT T GCCC TAT TAACAATGAAC GAACACGGC TACT T T CCA 
GAGATATTTGATGAAAGGATAAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGG 
TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
^ GATGAATGCTGA.TAATCAGGAAACATGAAAGAAGCCATTTGATAGATTATTCTAAAGGATATCAT 
m CAAGAA.GACTATTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCAAAAGACT 

rii atg 



m 

:• r. 

'A 

U1 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMS FSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains : 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites ♦ 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCC ATGA GTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
C| CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
Gil TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
j" 1 ; C TC AGGAATGT GACT GACAC G GAT AT T C T AG CC C T G GAAC GGC GACT GC T G CAAAC CAT G GAT AT GAT 

'?£ C^TAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 
[fl AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
\| CTTAT TCAACAGGAAGTGGAT GCT TTGGAAGAATTAAGCAGGCAGCTTT TT CT GGAAACAGCT GATCT 

3 ATATGCTACCAAGGAGAGAATAGAATACT CCAAAAC CTT CAAGGGGAAATATT TTAATT T TCTTGGTT 

W ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 

IB 

GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 
GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 

I'M 

•m TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
py CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAAT AC C GCAC CAT AAT CAC TGAAGT C C T T GGAGAACT G C AG T T CAAC T TC T AT C ACC GT T GGT T T G 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGC AAAT G GCACC T TGAA C T T AAGC C T AC TACAGACT GT T AGAGG CCAG T GGT TT C AAAAT T TA 
GATATAAGAGGGGGGAAAAAT GGAACCAGGGCCTGACATT TTATAAACAAACAAAAT GCT ATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATAC T CAGCAGAGAGCAT CCCGTGTGGATAT GAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
C TAT GGT AGCT GAGC CAAAC ACG T AGGAT TTCCGTTT TAAGGT T C ACAT GGAAAAGG TTAT AGC TT T G 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 4 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDIIJ^ERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
T VNYLG I QFDVKFWS QH I S FI LVG 1 1 1 VT S IRGLL I TLTKFFYAI S S SKS SNVI VLLLAQ IMGMY 
FVS SVLLIRMSMPLEYRT 1 1 TEVLGELQFNFYHRWFDVI FLVSALSS I LFLYLAHKQAPEKQMAP 

Important features : 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCTTGTGAGCAAAAAGGCGAACCAGCAGCTGAATTTCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
AT TAT C AC C AC C AAAGAT C C CAT AT T C AACAC T C AAAC T G C AAC AC AAAC AAC AG AAT T TAT T G T 

. u . CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGCTCCAGCTTCCACTT CTAT TCCACGGAGAAAAAAATT GATT TGTGTCACAGAAGTTTT TATG 

W GAAAC T AGCAC CAT G T C T AC AGAAAC T GAAC CAT T T G T T GAAAATAAAG CAG CAT T CAAGAAT GA 

rat, 

l,J AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
Pfj CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
} j I CAGCAGAAGGA^TGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
T{i TGAGGAAT CAAAGAAAACT GATAAAAACCCAGAAGAG T CCAAGAGTCCAAGCAAAAC TACCGT GC 

^ GATGCCTGGAAGCTGAAGT TTAGATGAGACAGAAATGAGGAGACACACCTGAGGCTGGTT TCTTT 

Ml CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
%j CCCTTGGTTCC TAAC TGGAATCAGC TCAGGAC TGCCAT T GGAC TATGGAGT GCACCAAAGAGAAT 

GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
^ T TC TAGCC T GGC TAT GTCCTAATAATATC CCAC T GGGAGAAAGGAGT T T T GCAAAGTGCAAGGAC 

CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
Ml AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
Q GCTC TGAAAGAGAAACACGTAT CCCACCT GACATG TCC T T CT GAGCCC GGTAAGAGCAAAAGAAT 

pjj GGCAGAAAAGT T TAGC CCC T GAAAGCCAT GGAGAT TC T CATAAC TTGAGACCTAATCT CTGTAAA 

GCTAAAAT AAAGAAATAGAACAAGGC T GAGGATACGACAGTACAC TGT CAGCAGGGAC T G T AAAC 
jj* ACAGACAGGGTCAAAGTGTTTTCTCTGAACACATTGAGTTGGAATCACTGTTTAGAACACACACA 
ril CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 

AAAATAAAAAC T C T T ATAAAT T T C TAT T T T TAT C T GAG T T AC AGAAAT GAT T AC T AAG GAAGAT T 
ACTCAGTAAT TTGTT TAAAAAGTAATAAAATTCAACAAACAT TTGCTGAATAGCTACTAT ATGTC 
AAGT GC TGT GCAAGGT AT TACAC T C TG T AAT TGAATAT TAT T CC TCAAAAAAT T GCACAT AGTAG 
AACGC TAT C T GGGAAGC TAT T T T T T TCAGT T T T GATAT TT CT AGC TT AT CTAC T TCCAAACTAAT 
TTTTATTTTTGCTGAGACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATT 
TAT TAT T AACAT ACC T AAG AAG T AC AT TGT T AC C T C TAT AT AC C AAAG C AC AT T T T AAAAG T G C C 
AT TAACAAAT GTATCACTAGCCC TCCT T TT TCCAACAAGAAGGGACT GAGAGATGCAGAAAT AT T 
TGTGACAAAAAAT TAAAGCAT TTAGAAAACTT 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVETALKASFETCSYGWVGDGFWISRISPNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKL I CVTE VFME TS TMS TETE P FVENKAAFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKWKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 



Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 



Tyrosine kinase phosphorylation site. 

amino acids 7 9-88 



i H 



N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGCATCATCAGAAGTGAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGAC 
ACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 
TGGACAAAT GGTC T TTT CAGAGACAGT TATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 



CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCT GCAC TGC T GGGAGGGGAAGAGAT T TAGAT C TGGACCAGGCT GT GGGTAGATGT GCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
T T TGT T C TACATGGC TTT GATAAT TGT T T GAGGGGAGGAGAT GGAAACAAT GTGGAGTC T CCCTC 
TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
T ACC T CT GT GCC AGGGCAGCAT T T TCAT AT CC AAGAT C AAT TCCCTCTCT C AGCAC AGC C T GGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
T TT TC T TGAGGCATGCACAT CT GGAATTAAGGTCAAAC TAAT TC TCACATCCC T C TAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
T AC AG G T T AAC C T GC AGAAAC AG T AC T T AGG T AAT T G T AG G G CGAG GAT T AT AAAT GAAAT T T GC 
AAAATCAC T T AGCAGCAACT GAAGACAAT T ATCAACCACGT GGAGAAAATCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GAT T G T AT AAGC AT GCTTTCTTT GAG TTT T AAAT TAT G T AT AAAC AT AAG T T GC AT T T AG AA 
ATCAAGCATAAATCACT TCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 




ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
GAGAT C C T G C T GC CCAGAGAGG T C C C C GAT GAG TAT GAAG T T GG CAGC T T CAT GGAG GAGG T G C G 
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FIGURE 8 



MQRLGATLLCLLIAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNSMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEWLANLPPSYHNETNTDTKTC 

TVITSVGDEEGRRSHECIIDEDCGPSMYCQFAS FQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPT FVGSRDQDGE I LLPREVPDEYEVGS FMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 



APP ID=1 0063557 



Page 158 



FIGURE Q 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
y„ CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
(JJJ TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
W ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 

?'! GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 

& 

1 ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 

Mi 

I [1 GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 

GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
^ AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 
^ CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
t ii| CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 

TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 



£3 AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 



ry 



GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGATT TT AT TAAAGATATT TTGT TAACTC 



APP_ID=10063557 



Page 159 of 318 



FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWLALGQCVLAGAFASFYWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIARVIL 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLSKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRWVLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

j ;2 amino acids 57-80 (type II), 110-126, 215-231, 254-274 

1 1 1 N-glycosylation sites. 

yl amino acids 16-20, 27-31, 289-293 

M i 

^ Hypothetical YBR002c family proteins . 

amino acids 276-288 

m 

Q Ammonium transporters proteins. 

\V amino acids 204-231 

fU 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 

GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATCGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 

Sr. :k 

:.,^ s CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
S GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
rff TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
UJ CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
Mj CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 

TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 

'""4 

GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
i.f| AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
Q GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
rtj ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
|;;fj ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
' 5>;:? CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGC TGA GGCAGCCTCA 

CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 

MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAGI PT VLQGHI DCGS LLG YRAVYRMC FATAAFFFFFFTLLMLC VS S SRDPRAAI Q 
NG FW F FK FL I L VGL T VG AF Y IPDGSFTNIW FY FGWG S FL F I L I QL VL LID FAH S WNQRWL GKAE 
ECDSRAW YAGL FFFTLLFYLLS I AAVALMFMY YTE P S GCHE GKVF I S LNLTFCVCVS I AAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVLASLHVMMTLTNWYKPGETRK^ 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA. 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAGG ATGA ACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGT T TGT CACCT TTGACC T CT T AT T CGTAACAT T AC T GT GGATAATAGAGT TAAATG 
T GAATGGAGGCAT T GAGAACACAT TAGAGAAGGAGGT GAT GCAGTAT GAC TACTAT TC T TCAT AT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
M? AC TGCGCCAT TGGTGGGCAATAGCGT T GACAACGGCAG TGAC CAGTGCCT T T T TAC TAGCAAAAG 

Q TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
J*? C T TGCC TGGAT TGAGACGT GGTTCC TG GAT T TCAAAGT GT TACCTCAAGAAGCAGAAGAAGAAAA 

i ' j CAGAC TCC T GATAGT T CAGGATGC T TCAGAGAGGGCAGCAC TTATAC C T GGTGGTCT T TCT GAT G 

jjl GTCAGT TT TAT TCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 

111 GAGAAACCACT TT TAGAACTATGAGTACTACTT TTGT TAAATGTGAAAAACCCTCACAGAAAGTC 

ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 

*p. CACTGCT GGCT T TAT T GAACAGC TAATAAAGAT T TAT T TAT T GTAATACC T CACAAACGTT GT AC 

W 

; f| CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 

■"•1? 5 

Q TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
rU AAAAGACT TAATATAT TGAAGTAACACTTT TTTAGTAAGCAAGATACCTTT TTAT TTCAATTCAC 

AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 

ru 

TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C T T GC AAAAG GG GAAGAAAGGAAT T GC GAAT ACATGT AAAAT GT CAC CAGAC AT T T GT AT TAT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
AC AAAAT GAC T T AAAC CAT T CAT AT CAT GTTTCCTTTGCGTT C AGC C AAT T T C AAT T AAAAT G AA 
CTAAAT TAAAAA 
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FIGURE 14 

MNHLPEDMENALTGSQSSHASLIWIHSINPTQIMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWIIELNWGGIENTLEKEVMQYDYYSSYFDIFLIAVFRFKVLILAYAVCRLRHWWAIALTT 
AVTS AFLLAKVI LS KL FSQGAFGYVLPI I S FI LAWI E T W FLDFKVLPQEAEEENRLL I VQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE is 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCT^AGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
^ GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
□ TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
Q CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
rfi TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
W TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
Ml CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
[|| CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
: y TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
^ CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
* ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 

GGCC TGTAC TGT GAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCC TACACCAGT CACGCC GAG 
Ljl GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
£*J AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
1* GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 

GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
Q GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
I'lj CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGC TGTGTGAC TCTAG T CT T GGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGAT GC 
T T TAGGAACATG TT T T GC T T TT T TAAAATAT ATAT AT T T ATAAGAGAT CC T T TC CCAT T TAT T CT 
GGGAAGATGTTT TTCAAACT CAGAGACAAGGACTTTGGTT TT TGTAAGACAAACGATGATATGAA 
GGCC TTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 



APP_ID= 



10063557 



Page 165 of 318 



FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGIAALQELDVSNLSLQALPGDLSGLFPRLRLIAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQT^REGNLPLLIAPALAAV 
LLAALAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCTTAGATTCCAAGACTACTTTGACATCAGATGAGTCAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGA 
AT TAGAAT CCTCTAT T CAAGAAGAGGAAGACAGCC T CAAGAGCCAAGAGGGGGAAAGT GTCACAG 
AAGATAT CAGCT T TC TAGAGTC T CCAAAT CCAGAAAACAAGGACTAT GAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
T T TCC TAGATAAGGAGT AT GAT GAAT GTACAT CAGAT GGGAGGGAAGAT GGCAGACTGT GGT GTG 
Q C TACAACC T AT GACTACAAAGCAGAT GAAAAGT GGGGCT T T TGTGAAACT GAAGAAGAGGCT GCT 

C3 AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAG 
j3'j CAATAAGAAAAGCCAAAAAAGAGAAGCATAT CGGTAT CT C CAAAAGGCAGCAAGCAT GAACCATA 

Mf CCAAAGCC CT GGAGAGAG TGTCATAT GCT CT T T TAT T T GG T GAT TAC T T GCCACAGAATATCCAG 

lit 

fn GCAGCGAGAGAGATGT TT GAGAAGC TGACT GAGGAAGGCT CT CC CAAGGGACAGAC T GC T CT TGG 

%l CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
n T T GGAGC T C T TGGGGGCAAT C TAATAGCCCACAT GGT T T T GGTAAGTAGAC T T TAGTGGAAGGCT 

AATAATAT TAACATCAGAAGAAT T TGT GG T TTATAGCGGC CACAACT TT TT CAGC TTT CAT GAT C 

I si 

^ CAGAT T T GC TT GTAT TAAGACCAAATATT CAGT T GAAC TT CCT T CAAAT TCT TGT TAAT GGAT AT 

f?j'| AACACATGGAATCTACATGTAAATGAAAGTTGGTGGAGTCCACAATTTTTCTTTAAAATGATTAG 
Q TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
j ! U GAAT TGT CAGAAT CAT T T T T TAC AT T AGAT TAT C AT AAT T T T AAAAAT T T T T C T T TAG T T T T T CA 

AAAT T T TGTAAAT GGTGGC TAT AGAAAAACAACATGAAATAT TAT ACAATAT TT TGCAACAATGC 
CC TAAGAAT TGT TAAAAT T CAT GGAGT TAT T TG TGCAGAAT GAC TCCAGAGAGC T CTAC T TT CTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGATTAACTCATTTT TAATAAAATTATGTCTAAGAT TAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRVVAGQIFLDSEESEL 
E S S I QEEEDSLKS QEGE SVTEDI S FLES PNPENKDYEEPKKVRKPALTAI EGTAHGE PCHFPFLFLDK 
E YDECTSDGREDGRLWCAT T YDYKADEKWGFCETEEEAAKRRQMQEAEMMYQT GMKI LNGSNKKS QKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S S QAKALV YYT FGALGGNL I AHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C- terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen -binding domain protein. 

amino acids 127-169 

Fructose -bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AAT TCAGAT T T TAAGCCCAT TC T GCAGT GGAAT T TCAT GAACTAGCAAGAGGACACCAT CT T CT T 
G TAT T AT AC AAGAAAG GAG T G T ACC TATC ACAC AC AGGGG G AAAA ATG C TCTTTTGGGTGC TAG G 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGT ACAT T T T TATCAC TGGATGTGACT CGGGCT T TGGAAAC T TGGCAGCCAGAACT TT T GAT 
AAAAAGGGAT T T CATGT AATCGCTGCCT GT CT GACT GAATCAGGAT CAACAGCT T TAAAGGCAGA 
AACC TCAGAGAGAC T T CGTAC TGTGCTTCT GGATGT GACC GACCCAGAGAAT GTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
M; CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
£13 CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
s ; , TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
Ml ATGCATTGAACCAGGATTGTTCAAAACAAACTTGGCAGATCCAGTAAAGGTAATTGAAAAAAAAC 
Ul TCGCCATTTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 
N AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 

CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 

?S T T T T C T GGAT AC CTCT GT CT CACATGCCAGCAGC TT T GCAAGAC T T T T TAT T GT TGAAACAGAAA 

Ml 

Q GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTCTTATCTAAATATTAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 



Q 

ru 
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FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPENWRTAQWVKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
REPIEWLFGLISVTL1SMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKIA^ 

LSPWECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGAT GATGTTGAAGT T GTTTATACAAT TGA 
CAT TCAGAAATATAT T CCAT GC TATCAGCT TT T TAGCT T T T ATAATT CT TCAGGCGAAGTAAATG 
AGCAAGCACT GAAGAAAATAT TATCAAAT GTCAAAAAGAAT GT GGTAGGT T GGTACAAAT TCCGT 
CGTCAT T CAGAT CAGATCAT GAC GT T TAGAGAGAGGCT GCTT CACAAAAACT TGCAGGAGCATT T 
T T CAAACCAAGACCT TGTTTTTCTGC TAT T AACACC AAG T AT AATAACAGAAAGC TGCT CT AC T C 
ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
Q GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
□ T T T TAGCCGAGCAGTACAAACACACAGCT CT AAAT T T TT T GAAGAAGATGGATCC T TAAAGGAGG 

M ! TACATAAGATAAATGAAAT G TAT GCTT CATTACAAGAGGAAT TAAAGAGTATAT GCAAAAAAGTG 

• i \ 

TjS GAAGACAG T GAAC AAG CAG T AGAT AAAC TAG TAAAGGAT G T AAAC AGAT T AAAAC GAGAAAT T GA 

If} GAAAAGGAGAGGAGCACAGATT CAGGCAGCAAGAGAGAAGAACAT CCAAAAAGACCC T CAGGAGA 

^4 ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
^ ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
*f AGACAATCTGACCTTAATGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGCTAGTACACCAC 
**k AAAT CAT T AAGCATAAAGCCT TAGACT TAGATGACAGAT GGCAAT TCAAGAGAT C TCGGTT GT TA 

fjj GATACACAAGACAAACGAT CTAAAGCAAAT AC TGGTAGTAGTAACCAAGATAAAGCAT CCAAAAT 

■53 GAGCAGCC CAGAAACAGAT GAAGAAAT T GAAAAGAT GAAGGGT T T TGGT GAATAT T CACGGT CTC 

1 ^ CTACAT TTTGATCCT TT TAACCT TACAAGGAGATTTTTTTATTTGGCTGATGGGTAAAGCCAAAC 

ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTT TGCAGTAATACACAGATAACTCTTAGTGCATTTACTT CACAAAGTACTTTT TCAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGATTGAT TGAAGC TAGGAGTTAGAGACCAGCCTGGGCAACGTATT 
GAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCTTATTTTCAAAATATGGAAA 
GAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAGTGATACTTTTTTAGAAGTA 
CATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCAATAAATTTGCAAAACATCATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGAIAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PC YQL FS FYNS S GE VNEQALKK I LSNVKKNWGWYK FRRHS DQ IMT FRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

Important features : 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCT C CAC GAGAT CAACGGT GAT CATTT GAAGAT CT GT CCCCAG GGT T CT ACCT GCT GCT CT 
r 5 * CAAGAGAT GGAGGAGAAGTACAGC CT GCAAAGTAAAGAT GATTT CAAAAGT GT GGT CAGC GAACAGTG CAATCAT TTG 

Ql CAAGCT GTCTTTGCTT CACGT TACAAGAAGTTT GAT GAAT T CTT CAAAGAACTACTTGAAAAT GC AGAGAAAT CCCT G 

■■><4 AAT GATAT GTT TGTGAAGACATAT GGCCAT TTATACAT GCAAAATT CT GAGCTATT TAAAGAT CT CTT C GT AGAGTTG 

M'l AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 

Mj ttccgcctggtgaactcccagtaccactttacagatgagtatctggaatgtgtgagcaagtatacggagcagctgaag 

Ms cccttcggagatgtccctcgcaaattgaagctccaggttactcgtgcttttgtagcagcccgtactttcgctcaaggc 

Mf ttagcggttgcgggagatgtcgtgagcaaggtctccgtggtaaaccccacagcccagtgtacccatgccctgttgaag 

'M atgatctactgctcccactgccggggtctcgtgactgtgaagccatgttacaactactgctcaaacatcatgagaggc 

* tgtttggccaaccaaggggatctcgattttgaatggaacaatttcatagatgctatgctgatggtggcagagaggcta 

^ gagggtcctttcaacattgaatcggtcatggatcccatcgatgtgaagatttctgatgctattatgaacatgcaggat 

i ft 

aatagtgttcaagtgtctcagaaggttttccagggatgtggaccccccaagcccctcccagctggacgaatttctcgt 
tccatctctgaaagtgccttcagtgctcgcttcagaccacatcaccccgaggaacgcccaaccacagcagctggcact 

* M agtttggaccgactggttactgatgtcaaggagaaact gaaacaggccaagaaattctggtcctcccttccgagcaac 

= : - ;:: h 

5«s gtttgcaacgatgagaggatggctgcaggaaacggcaatgaggatgactgttggaatgggaaaggcaaaagcaggtac 
i M ctgtttgcagtgacaggaaatggattagccaaccagggc^caacccagaggtccaggttgacaccagcaa^ 

atactgatccttcgtcaaatcatggctcttcgagtgatgaccagcaagatgaagaatgcatacaatgggaacgacgtg 
gacttctttgatatcagtgatgaaagtagtggagaaggaagtggaagtggctgtgagtatcagcagtgcccttcagag 
tttgactacaatgccactgaccatgctgggaagagtgccaatgagaaagccgacagtgctggtgtccgtcctggggca 
caggcctacctcctcactgtcttctgcatcttgttcctggttatgcagagagagtggagataattctcaaactctgag 
aaaaagtgttcatcaaaaagttaaaaggcaccagttatcacttttctaccatcctagtgactttgctttttaaatgaa 
t g gacaacaat gtacagtttt tact atgt gg ccact ggtt taagaagt gct gactttgttttct c at t cagt t ttggg 
aggaaaagggactgtgcattgagttggttcctgctcccccaaaccatgttaaacgtggctaacagtgtaggtacagaa 
ctatagttagttgtgcatttgtgattttatcactctattatttgtttgtatgtttttttctcatttcgtttgtgggtt 
tttttttccaactgtgatctcgccttgtttcttacaagcaaaccagggtcccttcttggcacgtaacatgtacgtatt 
tctgaaatattaaatagctgtac^gaagcaggttttatttatcatgttatcttattaaaagaaaaagcct^w^aagc 
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FIGURE 24. 

MARFGLPALLCTIAVLSAALIAAELKSKSCSEWRLWSK^ 

CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNWLEEMLNDFWARLLERMFRLWSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSWNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFID^ 

MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSWCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGIiANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KS ANEKADS AGVRPGAQAYLLTVFC I LFLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site, 

amino acids 514-518 

Glycosaminoglycan attachment sites ♦ 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCT GATGTCCATGGT CT CTAGCAGCCT GAAT CCAGGGGT CGCCAGAGGC CACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
h* GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
Q AGCGCCCACTCT TCCAAT TAAACAT T C TCAGCCAAGAAGACAGTGAGCACACC TACCAGACACT C 

^ TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
; : !J TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
[|I GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
=j1 AGCTAGT GT CATT TAACC T TAAATGCAATCAGGAAAGT AGCAAACAGAAGT CAATAAATAT T T T T 

AAATGTCAAAAAAAAAAAAAAAAAA 

U1 

ry 
ru 
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FIGURE 26 

MKVL I SS LLLLLPLMLMSMVS S SLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 



Important features: 
Signal peptide: 

amino acids 1-22 



N-myristoylation sites. 

! >J amino acids 27-33, 46-52 

□ 
w 

Ul 

in 

0 
in 
a 
m 
a 
ry 
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FIGURE 27 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCCCACC T GGAAACACAT TGGGGAT GGCTGCTGCCTCACCAGAGAGACCT GGAAGGATC T T GA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
AATC TCTAACT TCAAT CCCGCC T TCGACAG T GAA&AAGCT C TACT TC TACGC TGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGACTCAAAGAGGCAGAGGT TTTGT TCTCAAATATT T T TTAATAAATAGACGAA 
ACCACG 



FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTG7\NFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KH I GDGCCLTRE TWKDLENAQFSE I QMERQPPPLKWLPVGPHIMGKAVKQS FPS S KAL I CS FPS L 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 20 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCT ATG CTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGT TT ATGAT T CT TATACTAAT T T AT ACAAAGATAT TAAGGCCC T GT TCAT TAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAPlAAAAAA 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 



Important features: 
Signal peptide: 

amino acids 1-15 



kik Growth factor and cytokines receptors family: 

=:;*J amino acids 3-18 

Q 

m 
w 

in 

0 

in 
ru 
ry 
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FIGURE fti 



G T T T GAAT T CC T TCAACTATACC CACAGT CCAAAAGCAGAC TCAC TGT GTC C CAGGC TACCAGT T 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
T TCAGAT TCAT CACCACC CT T C TGG T TCACAT T T TCATT T CAT T GGT TAT T T T GGGATTGT T GTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
M !i CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
W AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
5j| TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
i.fl AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
kU T GGT TAC T T GT TAT T TCAACAGAAGTAAAAAT GATCC TCC T GATCAT CCCAT CCTTTCGTCTCTC 

TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT TCCGAGAATCAT T GT CATG TACAT GCAAAACGCACT GAAAGAACAGCAGCATGGTGCAT TGT 
: V| CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
Q AACC AGAAT GC ATATACT ACAACT GC TAT T AAT GGGACAGAT T TC T GT AC AT CAGC AAAAGAT GC 

fit AT TCAAAATCTTGTCCAAGAACTCAAGTCACTT TACATCTATTAACTGCTT TGGAGACTTCATAA 

$ TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 

ril 

AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAAC T CCAGGCCAT TGTGAGATAGATACCCATT TAGGTAT C TGTACC TGGAAAACAT T 
TCCT TC TAAGAGCCAT T TACAGAATAGAAGATGAGACCAC TAGAGAAAAGT TAGT GAAT T T T TT T 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 32 

MS GRDT I L GLC I LALALS LAMMFT FRFI T T LLVHI F I SLVI LGLL FVCGVLWWL Y YD YTNDLS I E 
LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
IL I FFWLWAVLLSLGTAGAAQVMEGGQVEYKPLSG IRYMWS YHLIGLIWTSE FILACQQMTIA 
GAWTCYFNRSKNDPPDHP I LS S LS I LFFYHQGTWKGS FL I S WRI PR 1 1 VMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FIIFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 
VDLE TNDGS SEKPY FMDQE FLS FVKRSNKLNNARAQQDKHS LRNEEGTE LQAI VR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristovlation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 3ft 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAAT TT GTAT GGGAT T TGTGAT GCAGGAAAGCC TAAGGGAAAAAGAAT AT TCAT TC T GTG T GGT 
GAAAAT T T T TT GAAAAAAAAAT T GCC T TC T T CAAACAAGGGTGTCAT T CT GATAT TTATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 
GATGT CAAAGCCGGAAAGATCAT CGAT CCT GAGT TCATTGT GAAAT GT CCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGT TAT TCCAACGGTGT C CAATCGT TAT CCC TACCACGAT GGAGAGAAT CCTT 
TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATAC TCAT CAT 
C GAAAAGT CCAGC TGCCCAAGCAGGT GAGACCACAAAAGCCTAT CAGAGGCCACC TAT TCCAGGG 
y« ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
^ CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
"IS GCCACAGGAGCCAGGAGAT GGATCT CTGG T CCACTGC CACC TACACAAGCAGCCAAAACAGGCCC 

AGAGC TGAT CCAGGTATCCAAAGGCAAGAT CCT TCAGGAGC TGCC TT CCAGAAACCTGT T GGAGC 
CP GGAT GT CAGCC TGGGACT T GTT CCAAAAGAAGAAT TGAGCACACAGT CT T T GGAGCCAGT AT CCC 

y TGGGAGAT CCAAACT GCAAAAT TGAC TTGTCGTTTT TAAT T GAT GGGAGCACCAGCAT TGGCAAA 

\'fi CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
; M J CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
^- ACACGAATTCTCGAGATCTGAAGACAGCCATAGAGAAAAT TACTCAGAGAGGAGGACTT TCTAAT 

H GTAGGTCGGGCCATCTCCT TTGTGACCAAGAACT TCTTTT CCAAAGCCAAT GGAAACAGAAGCGG 

;:; GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
m GACT T GCGAGAGAGT CAGGAATCAACATT T T C T T CAT CACCAT TGAAGGT GCTGC TGAAAAT GAG 

?% AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
Ml GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
Kv4 ACAC TGACCGCC TGGCCT GCAGCAAGACCTGC TTGAACTCGGCTGACATT GGCT TCGTCAT CGAC 

f-y GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
AGAGTT T GAGAT T TCC GACACGGACACGCGCATC GGGGCCG T GCAGTACACCTACGAACAGCGGC 
T GGAGT T T GG G T T CGACAAG TACAG C AGCAAGC C TGAC AT C CT C AAC G C CAT CAAGAGG G T G G GC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GT CCAAGCCCAACAAGAGGAAGT TAATGAT CCTCAT CACCGACGGGAGGT C CTACGAC GACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGC TAGAAGT CAT TGCCAC T CACCCCGCCAGAGACCACTCCT TC T T TGT GGACGA 
GT TT GACAACC T CCAT CAGT AT GTCCCCAGGATCAT CCAGAACAT TT GTACAGAGT TCAAC T CAC 
AGCC TCGGAAC TGAAT T CAGAGCAGGCAGAGCACCAGCAAGTGCT GC T T TACTAACTGACGTGTT 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
TGATCACAAACGTATAGAATGAGCCAAAAGGCTACATCATGTTGAGGGTGCTGGAGATTTTACAT 
TTTGACAATTGTTTTCAAAATAAATGTTCGGAATACAGTGCAGCCCTTACGACAGGCTTACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T T GTCAT GACAATGTAGGAAT TGCT GAAT TAAATGTT TAGAAGGATGAAAAATAAAAAAAAAA 
AAAAAAAAAAA?y\AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAZU^^AAG 
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FIGURE 24 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVS LGLVPKEELS TQS LE P VSLGDPNCKI DLS FL I DGS TS I GKRRFRI QKQLLADVAQALD I 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNWVVMVDGWPTDKVEEASRLARESGI^ 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMILITDGRSYDDVRI PAMAAHLKGVITYAI GVAWAAQEELEVIATHPARDHS FF 
VDEFDNLHQYVPRIIQNICTEFNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 



CCGAGC AC AG GAGAT T GCCTGC GT TT AGGAGGTGGCTGCGTT GT GGGAAAAGCT AT CAAGGAAG AAAT TGC 
CAAACC AT GT CT TT TT TT CTGT TT TC AGAGT AGT T C AC AACAGATCTGAGT GTT TT AAT T AAGC AT GG AAT 
AC AG AAAACAAC AAAAAACTT AAGCTTT AATT T C AT CT GGAAT T CC AC AGTTT T CT T AGC TCCCT GGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
GTCAGC CT TCCCC ACT AC AAT GTGATAGAACGCGTG AACT GGAT GT AC TTCT AT GAGT AT GAGCCG AT TT A 
CAGACAAGACTT TC AC TT CACACT T CG AGAGC AT T C AAAC TGCT CTC AT C AAAATC C AT T TCTGGT CATTC 
N 5 TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
M TGGT GGGG AT ATGAGGTT C T TACATTT T TCTT AT T AGGCC AAGAGGCT GAAAAGGAAGAC AAAAT GTT GGC 



Ml 



ATTGTCCTT AGAGGAT GAAC ACCTTCT T TATGGT GACAT AAT CCGACAAGAT TT TT T AGACAC AT ATAAT A 
ACCT GACCT T GAAAAC CATT AT GGCAT T CAGGTGGGT AAC T G AG TT TT GCCC C AATGCCAAGT ACGT AATG 
AAGACAGAC ACTGAT GTT TTCATCAAT ACT GGCAAT TT AGTGAAGTAT CT TT T AAACCT AAACC ACTC AG A 
\f{ GAAGTT TTT C AC AGGT TATCCT CT AATT GATAAT T ATT CC T AT AGAGGAT TT TACCAAAAAACCCATATTT 

\j CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
s; TTGGTGCC AAGG ATCT AT GAAATG AT GGGT CACGT AAAAC CC AT CAAGTTT G AAGATGTT T ATGT C GGGAT 

K! CTGT TTGAAT TT ATT AAAAGTG AACATTCATAT T CC AG AAGACACAAATCTT TT CT TT CT AT AT AG AATCC 

s w 

Mi AT T T GGAT GT CT GTC AACTGAGAC GTGT GAT TGC AGCCC ATGGCTT TT CTT CC AAGGAGATC AT CACTTTT 

Sf m TGGCAGGTCATGCTAAGGAACACCACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAG 

ijf GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 

J • "S 

ACACTGAACTGAAACTCATGAAAAACCCAGACTGGAGACTGGAGGGTTACACTTGTGATTTATTAGTCAGG 
CCCT TC AAAG AT GAT ATGT G GAGGAAT T AAAT AT AAAGGAAT TGGAGGTT T T TGCT AAAG AAAT TAAT AGG 
AC CAAACAAT TT GGAC AT GT CATT CTGT AGACTAGAAT T T CT TAAAAGGGT GTT ACTG AGTT AT AAGCT CA 
CT AGGC TGT AAAAACAAAACAAT GT AGAGTTTT ATT T ATT GAACAATGT AGT CACT TGAAGGTT T TGT GT A 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
CT GAAC AAAATT TT AC CT GTT T TT GGT C AT TT AT AAAGT ACT TC AAGATGT T GC AGT ATT TC AC AGTT ATT 
ATTATTTAAAATTACTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAG 
TG AATC AT TCT T TACATGCAAAC ATTTT CCAGTT AC TT AACT GATCAGTTT ATT ATT GAT ACAT CACT CCA 
TTAATGT AAAGT C ATAGGTC AT T ATT GC AT ATC AGT AATCTCT TGGACTTT GTT AAAT AT T T TACT GTGGT 
AAT AT AGAG AAGAATT AAAGCAAGAAAAT C TGAAAA 
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FIGURE 26 



MASALWTVLPSRMSLRSLKWSLLLLSLL 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KP I KFEDVYVG I CLNLLKVNI H I PEDTNL FFLYRI HLDVCQLRRVI AAHG FS SKE 1 1 T FWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCT TGCC AAGAGAGT AC AC AGTCAT TAATGAAGCCT GC C C TGGAGCAGAGTGGAATATCAT GTG TCGGGAGTGC TGTG 
AAT ATGATCAGAT TGAG TGCGTCTGCCCC GGAAAGAGGGAAGT C GTGGGTTATAC CATCCC TT GC TGC AGGAATGAGGAGAA 
TGAGTGT GAC TC C TGCC TGATCCACCCAGGT TGTACCATCT T T GAAAAC TGC AAGAGC TGC CGAAATGGCTC ATGGGGGGGT 
ACC TTGGAT GAC T TC TATGTGAAGGGGT TCTAC TGT GCAGAGT GC CGAGCAGGCTGGTACGGAGGAGAC TGC ATGCGATGTG 
GCCAGGTTCTGCGAGCCCCAAAGGGTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATG^ACC^TTCATGC 
TAAACCTGGGTT T GT CATCCAAC T AAGAT TT GT CATGTTGAGTC TGGAGT T T GAC TACATGTGCCAGTATGAC TATGTTGAG 
G TTCGTGATGGAGACAACC GC GATGGCC AGATC ATCAAGC GTGTC TGTGGCAACGAGCGGCCAGC TCC TATC C AGAGCAT AG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 

y_ ATGCTCC TCATC CCC T TGTT T CCATGACGGCACGT GCGTC C T TGACAAGGCT GGAT C TTACAAG TGTGCCT GC TTGGCAGGC 

*s TAT AC TGGGCAGCGC T GTGAAAATC TCC T TGAAGAAAGAAAC TGC TC AGACCC TGGGGGCC CAGTCAATGGGTACC AGAAAA 

P"?g TAACAGGGGGCC C TGGGCT TAT CAACGGACGCC AT GC TAAAAT TGGC ACCGTGGT GTCTTTCTTT TGTAACAACTC C TATGT 



T CT TAGTGGCAATGAGAAAAGAAC TTGCCAGCAGAATGGAGAG TGGTC AGGGAAAC AGC CCATC TGC ATAAAAGCC T GC C GA 
GAACCAAAGATT TCAGACC TGGTGAGAAGGAGAGTTC TTCC GATGCAGGTTCAGT C AAGGGAGACACCAT TACAC CAGC TAT 
ACT CAGCGGCC T TCAGC AAGCAGAAAC TGCAGAGTGCC CC TACCAAGAAGC C AGC CC TTCC C T T TGGAGATC T GCCC ATGGG 

atacc aacatc t gcatacccagctccagtat gagtgcatc t cacccttc taccgccgcc tgggcagcagcaggaggacatgt 
ctgaggactgggaagtggagtgggcgggcaccatcctgcatccctatctg<:gggaaaattgagaacatcactgctccaaaga 
ccc aagggttgcgc tggccgt ggcaggcagccatc tacaggaggaccagcggggtgc atgacggcagcc tacacaagggagc 
gtggttcctagtctgcagcggtgccctggtgaatgagcgcactgtggtggtggctgcccactgtgttactgacctggggaag 

GTC ACCAT GATCAAGAC AGCAGACC TGAAAGT TGT T TTGGGGAAATTC TAC C GGGAT GATGACCGGGAT GAGAAGACC ATCC 
AGAGCCTACAGATTTCTGCTATCATTCTGCATCCCAACTATGACCCCATCCTGCTTGATGCTGACATCGCCATCCTGAAGCT 
C CTAGAC AAGGCCCGTATCAGCACCCGAGTC CAGC C C ATCTGC C TCGC TGCCAGT CGGGATC TC AGCAC T TCC TTC CAGGAG 
TCCCACATCACTGTGGCTGGCTGGAA.TGTCCTGGCAGACGTGAGGAGCCCTGGCTTCAAGAACGACACACTGCGCTCTGGGG 
; - 4 TGGTCAGTGTGGTGGACTCGC TGC TGTGTGAGGAGCAGCAT GAGGACCATGGCATCC CAGTGAGTGTC AC T GATAACATGTT 

CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
CGAGC ATC TCCT GAGCCACGC TGGC ATC TGATGGGACTGGT CAGC TGGAGCTATGAT AAAACATGCAGCCACAGGC TC TCCA 
CTGCC TTCACCAAGGTGC TGCCTT T TAAAGACT GGAT TGAAAGAAATATGAAAT G&ACC AT GC TCATGCAC TC CT TGAGAAG 
TGTT TCTGTATAT C C GT C TGTACG TGTGTCATT GC G TGAAGCAGTGT GGGCC T GAAGTGTGAT T TGGCC TGTGAAC T TGGCT 
GTGCCAGGGCT T C T GAC T TCAGGGACAAAAC T C AGTGAAGGGTGAGTAGACC T CC AT TGCTGGTAGGC TGATGCC GC GTC CA 
C TAC TAGGACAGC CAAT TGGAAGATGCC AGGGC T TGCAAGAAGT AAGT T TCT T CAAAGAAGACC ATAT ACAAAAC C TC TCCA 
CTCCACTGACC TGGT GGTC T T CCCCAAC T TT C AGT TATACGAATGCC ATCAGC T TGACCAGGGAAGATC TGGGCT TCATGAG 
GCCCC TT TTGAGGCT CT CAAGT TC TAGAGAGC TGC C TGTGGGAC AGCC CAGGGCAG CAGAGCTGGGATGTGGT GCAT GCC TT 
T GTGTACATGGCCACAGTACAGTC TGGTC CT T T TCC TTCCC CATC TC T TGTACACAT TT TAATAAAATAAGGGTT GGCTTCT 
GAAC TAC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



ry 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLES YPLNAHCEWT IHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQI I 
KRVCGNERPAP I Q S I GS S LHVL FHS DGSKNFDGFHAI YEE I TAC S S S PC FHDGTC VLDKAGS YKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAI YRRTSGVHDGSLHKGAWFLVCS GAL WERT VWAAHCVTDLGKVTMIKTADL 
13 KWLGKFYRDDDRDEKT I QS LQI SAI I LHPNYDP I LLDAD I AILKLLDKARI S TRVQPI CLAAS R 

JJ* DLSTSFQESHITVAGWNVIiADVRSPGFKNDTLRSGWSVVDSLLCEEQHEDHGIPVSVTDNMFCA 
"Jj SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
[fl ERNMK 

n 

H Important features of the protein: 

Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature. 

amino acids 2 60-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 
474-480, 491-497, 638-644, 666-672 
Amidation site, 
amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 
CUB domain proteins profile. 

amino acids 150-167 
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FIGURE so 



GGTT C CTACAT CCTCT CAT CT GAGAATCAGAGAG CATAAT CT T CTT ACGGG C C CGT GAT TTAT TAAC GT GGCTTAAT C 
T GAAGGTT CT CAGT CAAATT CTTT GT GAT CTACT GATT GT GGGGGC AT GGCAAGGT TT GCT TAAAGGAGCTTGGCT GG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGA ATGA AGGCGCTTCTGTTGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAG GT GCCT CC CACTAC GGC CTGACCAAAGATAGGAAGAGGCGCTCACAAGAT GGCTGT C CAGAC GGCT GT G C GAGCC 
T CACAGC CACG GCT CCCT CC CCAGAGGT TTCTGCAGCT G CCAC CAT CT C CTTAAT GACAGACGAGC CTGGCCTAGACA 
ACC C T GCC TAC GT GT C CT CG GCAGAGGAC GGGCAGCCAG CAAT CAGCCCAGT GGACTCT GGCCGGAGCAACCGAACTA 
GGG CACG GCCCTTT GAGAGAT CCACTAT TAGAAGCAGAT CAT TTAAAAAAATAAAT CGAGCTT T GAGT GT T CTT CGAA 
GGACAAAGAGCGGGAGTGCAGTT GCCAAC CATGC CGACCAGGGCAGGGAAAAT T CT GAAAACACCACT GC CC CT GAAG 
TCTTTCCAAGGTTGTACC^CCTGATTCCAGATGGTGAAATTACCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAA 
h d GCCTCT CTATTAGGCT GGT G GGAGGT AGC GAAAC CC CACT GGT CCATATCATT AT C CAACACATTTAT C GT GAT GGGG 

T GAT CGCCAGAGACGGCCGGCTACT GCCAGGAGACAT CAT T CTAAAGGTCAACGGGAT GGACAT CAGCAAT GT C CCTC 
M a ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
' Z 'M GCAG GAACAAT GGACAGGCC CCGGAT GC CTACAGAC CCCGAGAT GACAGCTTT CAT GT GAT TCT CAACAAAAGTAGCC 

CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
^ ! J GTGTGGCATATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATC^TGGACATGATCTTCGATATGGCA 
^ GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
§ GGAG CCCT GACAT CTTTCAGGAAGCCGG CT GGAACAGCAATGG CAGCT GGT CC CCAGGGC CAGGGGAGAGGAG CAACA 

M CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 

in 

GCATGACC GT CGCAGGGGGAG CATCACATAGAGAAT GGGAT TTGCCTAT CTAT GT CATCAGTGTT GAGCCCGGAGGAG 
•;i « T CATAAGCAGAGAT GGAAGAATAAAAACAGGTGACATT TT GTT GAATGT GGAT GGGGT CGAACT GACAGAGGT CAGCC 

* ; M GGAGT GAG GCAGT GGCAT TATTGAAAAGAACAT CAT CCT CGATAGTACT CAAAGCTTT GGAAGT CAAAGAGTAT GAGC 

W CCCAGGAAGACT GCAGCAGC CCAGCAGCC CTGGACTCCAAC CACAACAT GGCC CCACCCAGT GACT GGT C CC CAT CCT 

■ GGGT CATGT GGCT GGAATTACCAC G GT GCT T GTAT AACT GTAAAGATATTGTAT TACGAAGAAACACAGCT GGAAGTC 

T GGGCT TCT GCATT GT AGGAGGT TAT GAAGAATACAAT GGAAACAAACCT TTTT TCAT CAAAT CCATT GTT GAAG GAA 
CAC CAGCATACAAT GATG GAAGAATTAGAT GTG GT GATATT CTT CT TGCT GT CAAT GGT AGAAGTACAT CAGGAAT GA 
TACAT GCTT GCTTG GCAAGACTGCT GAAAGAACT TAAAGGAAGAAT TACT CTAACTATT GT TTCTT GGC CT GGCACTT 
TTTT ATAGA AT CAAT GATGGGT CAGAGGAAAACAGAAAAAT CACAAAT AGGCTAAGAAGTT GAAACACTATAT TTAT C 
TTGT CAGTTTT TATATTTAAAGAAAGAATACATT GTAAAAATGT CAGGAAAAGTAT GAT CATCTAAT GAAAGCCAGTT 
ACAC CT CAGAAAAT AT GATT CCAAAAAAATTAAAACTACTAGTT TTTTT T CAGT GT GGAGGAT T T CTCATTACT CT AC 
AACATT GTTTATAT TT TTT C TATT CAATAAAAAG CCCT AAAACAACTAAAATGATT GAT TT GTATACCCCACT GAATT 
CAAGCTGAT TTAAATT TAAAATTTGGTATAT GCT GAAGT CT GC CAAGGGTACAT TATGGC CAT T TTTAATTTACAGCT 
AAAAT ATTTTTTAAAATGCAT TG CT GAGAAACGTT GCTTT CAT CAAACAAGAATAAATATT TT T CAGAAGTTAAA 
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FIGURE 4Q 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I S LMTDE PGLDNPAYVS SAE DGQPAI S PVDSGRSNRTRARPFERS T IRSRS FKKINR 
ALS VLRRTKS GS AVANHADQGRENS ENTTAPE VFPRL YHL I PDGE I TS I KINRVDPS E S LS IRL V 
GGSETPLVH III QHI YRDGVI ARDGRLLPGD I I LKWGMDI SNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGS PE S AAHL I QASERRVHLWSRQVRQRS PD I FQEAGWNSNGS WS PG 
PGERSNTPKPLHPTITCHEKVVNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N -myr i s toyl a ti on s i tes . 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 

ACCAGGCATTGT ATCT T CAGTTGTCATCAAGT T CGCAAT CAGATT GGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 
GACAACT GTCCTT CTGTGT CTCCTTACCT CAGAGGCCAGAGCAAGCTCATTT T CAAACCAGATCT CAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
m CATCTGCATCCCT TCCT GCAGAGGCAGCAGCT GGAT TAT GGCATCTACGTCAT CCACCAGGCTGAAGG 

Q TAAAAAGTTTAATCGAGCCAAACT CTTGAATGT GGGCTATCTAGAAGC CCT CAAGGAAGAAAAT T GGG 

ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
"MJ CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
^ GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
'fl GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 

Hi 

r;J GAT GAAGC TCT TACACCAAGTGT CACGAGTCT GGAGAACAGATGGGTTGAGTAGTTGTT CTTATAAAT 

Ijl TAGTATCT GTGGAACACAATCCT T TAT AT AT CAACAT CACAGTGGATTT CTGGTTTGGT GCATGACCC 

□ TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
^jj ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
if! T AGCAGAGCTCCTGGT GAT GTAGAGTATAAAACAGT TGTAACAAGACAGCTTT CT TAGTCATT TTGAT 

CAT GAGGGTTAAATAT T GTAATATGGATACTT GAAGGACT TTATATAAAAGGATGACTCAAAGGATAA 
AAT GAAC GCTAT T TGAGGACT CT GGTT GAAGGAGAT T TAT TTAAATT T GAAGTAATATATT ATGGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTAT TCAT TTATCCT GTACAAT CATCTGTGAAGTGGTGGT GT CAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAAT CAGGACACAGTGAACT T GGGAAT GAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
T TTTAAAGAGT TTTT GTAAAATGAT TTTGTACAAGTAGGATAT GAAT TAGCAGTTTACAAGTT T ACAT 
ATTAACTAATAATAAATAT GT CTAT CAAATACCTCT GTAGTAAAATGT GAAAAAGCAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMYFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
S VEHNPLY INI T VDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 

GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACG TAGA ACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAA?U^AAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 44. 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CI FCCGCCHRSKCGMCCKT 



: 4 
I* % 

m 

in 

m 



m 



Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 



M amino acids 1-12 
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FIGURE A* 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACC TTCAACACAACCCCT CT TGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAAT GACT CAGGGATC TAC TATGT GGGGATATACAGCT CAT CACT CCAGCAGCCC TCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
%] AT AAGAAT G GCACC TGTG TGACCAAT CTGACATGCTGCATGGAACAT GGGGAAGAGGAT GT GATT 

m TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
SI C TGGAGATGGGGAGAAAGTGATATGACCT TCATCTGCGTT GCCAGGAACCCTGTCAGCAGAAACT 

W TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 

GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GTT TCTGAAGAGAGAGAGACAAGAAGAGTACAT TGAAGAGAAGAAGAGAGT GGACAT T TGT CGGG 
AAAC TCCTAACATATGCCCCCAT TCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAAT 
AGAACAATC CTAAAGGAAGATCCAGCAAATACGGT TT ACT CCAC TGT GGAAATACCGAAAAAGAT 
GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
T CTAGACAGCAG T GCACT C CCC TAAGT CT C T GC TCA 
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FIGURE 46 



mGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNP VSRNFS S P I LARKLCEGAADDPDS SMVLLCLLLVPLLLS LFVLGL FLWFLKRERQEEY I E 
EKKRVDICRETPNI CPHSGENTEYDTI PHTNRT I LKEDPANTVYSTVE I PKKMENPHSLLTMPDT 
PRLFAYENVI 



Important features: 



Signal peptide: 



amino acids 1-22 



yi 



Transmembrane domain: 



amino acids 224-250 




N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 



Leucine zipper pattern. 



amino acids 229-251 



291-295 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAAT GCCAAT T GT GAAT T TT CATT GAAAAACATCAGTGACAT T CAT CCAGAATCCT T CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCCACTTCGATTCTGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T G TAG T T T AAT GGGAATAAAAT G T AAG T AT CAG TAG T T T GAAAAAAAAAAA 



X 

m 
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FIGURE d8 



MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLTARKRACCNNRTGMFLS S FFS VITVIGALYCML I S I QALLKGPLMCNS PSNSNANCE FSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 



N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 




amino acids 223-227 



: ;:i\4 

a 



N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 



Prokaryotic membrane lipoprotein lipid attachment site. 



amino acids 207-218 



TNFR/NGFR family cysteine-rich region protein. 



amino acids 4-12 
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FIGURE 4Q 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGGAAATACAAGAG 
CAGC CAGAAGCAGCACAGT CC T GTACCT GAGAAGGCCAT CC CAC TCATCACTCCAGGCTCT GCCA 
C TAC T TGC TGAG CACAGGACTGGCC TCCAGGGAT GGCC T GAAGCC TAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 



m TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 



MERVTIJyLLLIAGLTALEANDPFMKDDPFYYDWKNLQLSGLICGGLIAIAGIAAVLSGKCKYKS 
SQKQHSPVPEKAIPLITPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 



1 



Transmembrane domain: 



amino acids 36-59 



01 

w 

U! 

m 



N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 



in 

O 

m 



Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 



amino acids 54-67 
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FIGURE 51 

GTGGACTCT GAGAAGCCCAGGCAGT TGAGGACAGGAGAGAGAAGGC TGCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCAC TGGGACAAATAT TGGGGAGGCCC TT GGACATGGCCT GGGAGACGCC CT GA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCT TGGCCAAGGGACCAGAGAAGCAGT T GGCACT GGAGT CAGGCAGGT T CCAGGC T T TGGCGC 
V AGC AGAT G C T T T GG GCAACAG GG T C GG GGAAGCAGC C CAT G C T C T GGGAAACAC T GGGCAC GAGA 

J5 TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
rf| GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
yj CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
hfl GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 

ni 

; ! ': GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
AGC CAGCAAC CAGAAT GAAGGG T GC AC GAAT C C C C C AC CAT C T GGC T CAGG T GGAGGC T C CAGC A 
p ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAAC7\AC 
ifl AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
S3 TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

m 

\Z CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
jJjj CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
C T TCAGAGGACAGGGAGT T TCCAGCAACAT GAGGGAAATAAGCAAAGAGGGCAATCGCCT CCT TG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAA7\AAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 

MKFQGPIACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 

EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAJiALGNTGHEIGRQAEDVIRHGADAW 

GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 

GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 

SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 

ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 

FNFDT FWKNFKS KLGF I NWDAI NKDQRS SRI P 

Signal peptide: 

amino acids 1-21 

N-glycosylation site. 

amino acids 2 65-269 

Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 



Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATCTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAA.TGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
CC T CAGAGGGCAGCAGT CGTC TGGACAT GT TT GAGCACATCAGCC TCATGACCT TGGACAGT CTA 
CAGAAAT GCATC T TCAGC T TT GACAGCCAT TGT CAGGAGAGGCCCAGT GAATATATTGCCACCAT 
CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
g GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
53 AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 
Ml CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 
jj* GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
?ii AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 54 



MS LL S LP WLGLRP VAMS PWLLLLL WGS WLLAR I LAW T YAFYNNCRRLQC FPQP PKRNW FWGHLG 
L I TPTEEGLKDS TQMSATYSQGFTWLGPI I PFIVLCHPDT IRS I TNASAAIAPKDNL FIRFLKP 
WLGEGI LLSGGDKWSRHRRMLT PAFHFNI LKS YI T I FNKSANIMLDKWQHLASEGSSRLDMFEH I 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
lAFIPFSAGPRNCIGQAFAMAEMKVVLALMLL^ 
VGLQ 



^1 

Q 
rfl 

w 

I flg Important features : 

[ft Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

P3J Cytochrome P450 cysteine heme-iron ligand signature. 

n amino acids 461-471 



N-glycosylation sites. 

y amino acids 112-116, 168-172 
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FIGURE 55 

AT CGCATCAAT T GGGAGT AC CATC T T CC T C ATG GGACC AGT GAAACAGCTGAAGCGAAT GT T TGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAA T T CATGGC CAGT TT TATGAAGC T T T GGAAGGCACT AT GGACAGAAGCT GG TGGACAG T TTT 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGAT T CGAACAT T TGAGGGTTAC T T T TGGAAGCAACAATACAT T CT CGAAC C T GAAT G T CAGTA 

S3 

% GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
3 TC TGGATGTT GT CCCACT GAAT TCCCATGAATACAAACCTAT T CAGCAACAGCAAAAAAAAAAAA 

\ AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE s6 

MGPVKQLKRMFEPTRLIAT IMVLLCFALTLCSAFWWHNKGLAL I FCILQS LALTWYSLS FI PFAR 
DAVKKCFAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen-binding domain protein. 

amino acids 30-72 



APP ID=10063557 



Page 206 



FIGURE S7 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAAC ATG CTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTTCAAGCTCAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCT 
GGAGATAAGGAC CTT GAT GGGCAGCTAGACT TT GAAGAAT TT GT CCATTAT CT CCAAGAT CAT GAGAAGAAG C T GAGG 
CTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATGCAGTCCCTGCGGGACTTG 
GGAGT CAAGATAT CT GAACAGCAGGCAGAAAAAAT T CT CAAGAGCAT GGATAAAAACGGCACGATGAC CAT CGACT G G 
AACGAGTGGAGAGACTACCACCTCCTCCACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACG 
AT CTTT GATGT GGGT GAGAAT CTAACGGT C CCGGAT GAGT TCACAGT GGAGGAGAG GCAGACG GGGAT GT GGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGTGGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGG 
TCACTCTGGCGGGGCAATGGCATCAACGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAG 
ATCAAG CGCCT TGTT GGTAGT GACCAGGAGACT CT GAGGATT CACGAGAGGCT T GT GGCAGGGTCCTT GGCAGGGGC C 
, . ATC GC C CAGAGCAGCATCTACCCAAT GGAGGTCCT GAAGACCC GGAT GGC GCT GCGGAAGACAGGCCAGT ACT CAGGA 

l* w " ATGCT GGACT GCGCCAGGAGGAT CCT GGCCAGAGAGGGGGT GGCCGCCTT CTACAAAGG CTAT GT CCC CAACAT GCT G 

I J GGCATCATCCCCTATGCCGGCATCGACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTG 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
■f'f CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
rff TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGC 
V4 CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
yl AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
(pi GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
: '-^| TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
^ ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
fjf TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
C'l CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
ACT GTT GGGAAAAGGGTTT T GT CCAGAAG GACAAGCCG GACAAAT GAGCGACT TCT GT GCTTCCAGAGGAAGAC GAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
Q| G GGAC CAGCCCCACATT CCACTT GT GT CACT GCTT GGAACCTATTTAT TTT GTATT TATTT GAACAGAGTTAT GT CC T 

.* : i AACTAT TTTTATAGATTT GTTTAATTAATAGCTT GT CATT TTCAAGTT CATTT TTTATT CATATTTAT GTT CAT G GTT 

• 5 ^ GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 

GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CT T CT GCT G C CCTT GCTTAACAAT GC CGGC CAACT GGCGACCT CAC GGTT G CACTT CCATT CCAC CAGAAT GAC CTGA 
T GAGGAAATCT T CAATAGGAT G CAAAGAT CAAT G CAAAAATT GT TAT ATATGAACATATAACT GGAGT C GT CAAAAAG 
CAAATTAAGAAAGAATT G GAC GT TAGAAGTT GT CATTTAAAGCAGCCTTCTAATAAAGTT GTT TCAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *8 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNE WRD YHLLHPVEN I PE I ILYWKHS T I FDVGENLT VPDE FT VEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVIMQVHASRSNNM^ 

FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
I LAREGVAAFYKG YVPNMLG 1 1 P YAG I DLAVYE T LKNAWLQH YAVNSADPGVFVLLACGTMS S TC 
GQLAS YPLALVRTRMQAQAS IEGAPEVTMSSLFKHI LRTEGAFGL YRGLAPNFMKVI PAVS I S YV 
VYENLKITLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 



GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACAT T GGGGAGGAT GGAAT CC T GAGC TGCACT T TT GAACC TGACAT CAAACT TT C TGAT AT 
CGTGAT ACAATGGC TGAAGGAAGGT GT T T T AGGC TTGGTCCAT GAGT T CAAAGAAGGCAAAGAT G 
AGC T G T C G GAGC AGGAT GAAAT G T T C AGAG GC C GGACAG C AGT G T T T GC T GATC AAG T GAT AGT T 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CAT C AC T T C T AAAG GC AAG G GGAAT G C T AAC C T T GAG T AT AAAAC T GG AG C C T T C AG CAT G C C G G 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
TCAACAACACATACTCCTGTATGATTGAAAATGACATTGCCAAAGCAACAGGGGATATCAAAGTG 
ACAGAAT CGGAGAT CAAAAGGCGGAG T CACC TACAGC TGC TAAAC TCAAAGGCT T CTCT GT GTG T 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
GTGCCTTGGCCACAAAAAAGCATGCAAAGTCATTGT TACAACAGGGAT CTACAGAAC TATTTCAC 
CACCAGATATGACCTAGT T TTATAT TT CT GGGAGGAAATGAAT T CATATCTAGAAGTC TGGAGTG 
AG CAAACAAGAG C AAGAAACAAAAAGAAGC C AAAAGCAGAAGGC T C C AAT AT GAACAAGAT AAAT 
C TATC TTCAAAGACATATT AGAAGT TGGGAAAATAATT CATGTGAAC TAGACAAGTGT GT T AAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACC TGGGGAGT GAGAGGACAGGATAGTGCATGT TC T T TGTCTC TGAAT T T T TAG T TAT AT G T GC 
T GTAATGT T GC T C T GAGGAAGCCCC TGGAAAGTC TATCCCAACATAT CCACATC T TATAT TCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCAT T T TAGTAAT GGGT CAAATGATT CAC T T TT TAT GATGC T TCCAAAGGT GCC T TGGCT TC 
TCT TCCCAACT GACAAAT GCCAAAGTT GAGAAAAAT GAT CAT AAT T T T AGCATAAACAGAGCAGT 
CGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAA7\AAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVSSFFAI SWALLPLS PYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



Lfl 
U1 



in 



TGACGTCAGAATCACC ATG GCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
ATTGGTCTTCATTCAATGATGAGACCTGCCTCATGATGATAAACATGTTTGACAAGACCAAGTCA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCC T TCCGGGAGAAGGACACAGCTGT ACAAGGCAACATCCGGCT CAGCT T CGAGGAC TTCG T CA 
CCAT GACAGCT TC TCGGAT GCTATGACCCAACCATC T GTGGAGAGTGGAGT GCACCAGGGACCT T 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AAT AGTGAGGACCGGGGCT GAGGCCACACAGATAGGGGCC T GATGGAGGAGAGGATAGAAG T T GA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATTTGGGGCCAAAAGTCCAGTGAAATTGTAAGCTTCAATAAAAGGATGAAACTCTGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALTOCmSSFNDETCLMMIlSlMFDKTKSGRIDWGFSALW 

DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites, 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites . 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTC.TCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
CAGC T GGGGAAATAT TAAT CGG TATC CAGGAGGCAGC TGGGGGAATAT T AAT CGG TATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
TGCTCATC TCCAATAAAATAAAAGCAC T TAT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 6a 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 



Important features of the protein: 
Signal peptide: 

amino acids 1-26 



1*1 Casein kinase II phosphorylation sites. 

Ill amino acids 56-59, 155-158 

in 

N-myristoylation sites. 

m amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 

\n 259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 65 



"HI 



Ml 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACT ATGG GGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYSSSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 



N-myristoylation sites. 



j^l N-glycosylation site. 

Kf amino acids 37-41 

m 

m 

[fs amino acids 15-21, 19-25, 60-66 

o 
m 
a 
m 

Q 

m 
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FIGURE 67 



ACGGACCGAGGGT T CGAGGGAGGGACACGGACCAGGAACC TGAGC TAGGT CAAAGAC GC CC GGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGACTCT GCCCC TGAGGTCAAGAGAGGAT GGGGC TAT T CAC TT TTATATAT TTATATAAAAT TAG 
TAGTGAGAT GTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 

MANPGLGLLLALGLPFLLARWGRAWGQI QTT SANENSTVLPS S T S S S S DGNLRPEAI TAI IWFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 



□ 



Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 



APP_ID=1 0063557 Page 218 of 318 



FIGURE 6q 



GCCAGGAATAACTAGAGAGGAACAATGG GGT TATT CAGAGGT TT T GT TTTC CT CTTAGT TCT GT GC CT G CT GCACCAG 
T CAAATACTTCCTT CATTAAGCT GAATAATAAT GGCTT T GAAGATATT GTC ATT GTTATAGAT CCTAGT GT GCCAGAA 
GATGAAAAAATAATTGAACAAATAGAGGATATGGTGACTACAGCTTCTACGTACCTGTTTGAAGCCACAGAAAAAAGA 
TTTTTTTTCAAAAATGTATCTATATTAATTCCTGAGAATTGGAAGGAAAATCCTC^GTACAAAAGGCCAAAACATGAA 
AAC CATAAACAT GCTGAT GT TATAGTTGCACCAC CT ACACT CC CAGGTAGAGATGAAC CATACACCAAGCAGTT CACA 
GAAT GT GGAGAGAAAGGC GAATACATT CACT TCACC C CT GACCTT CTACTT GGAAAAAAACAAAAT GAATAT GGACCA 
C CAGGCAAACTGTTT GT C CAT GAGT GGGCTCACCT C C GGTGGGGAGT GT TT GAT GAGTAC AAT GAAGAT CAGCCT T T C 
TAC C GT GCTAAGT CAAAAAAAAT CGAAGCAACAAGGTGTTC CGCAGGTATCT CT GGTAGAAATAGAGT T TATAAGT GT 
CAAGGAGGCAGCT GT CTTAGTAGAGCAT GCAGAATT GATTCTACAACAAAACT GT AT GGAAAAGAT TGT CAATT CT TT 
C CT GATAAAGT ACAAACAGAAAAAGCAT CCATAAT GTT TAT GCAAAGTATT GATTCT GT TGTT GAATT TTGTAACGAA 
AAAACC CATAAT CAAGAAGCT C CAAGCCTACAAAACATAAAGT GCAATT TT AGAAGTACAT GGGAG GT GATTAGCAAT 
T CT GAG GATTT TAAAAACAC CATACCCAT GGTGACAC CACCT C CTCCACCT GT CTT CT CATT GCT GAAGAT CAGTCAA 
AGAATT GT GT GCTTAGTT CT T GATAAGT CTGGAAGCAT GGGGGGT AAGGACCGC CTAAATCGAATGAATCAAGCAGCA 
AAACATT T CCT GCT GCAGACT GT T GAAAAT GGAT CCT GGGT GGGGAT GGTT CACT TT GAT AGTACT GCCACTATTGTA 
AATAAG CTAATC CAAATAAAAAGCAGT GATGAAAGAAACACACT CATGGCAGGAT TAC CTACATATC CT CT GGGAGGA 
ACT T C CAT CTGCTCT GGAAT TAAAT AT GCAT TT CAGGT GATT GGAGAGCTACATT CCCAACT CGAT GGATC C GAAGTA 
CTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTGAAACAAAGTGGGGCCATTGTTCAT 
TTTATT GCT TT GG GAAGAGCT G C T GATGAAGCAGTAATAGAGAT GAGCAAGAT AACAGGAGGAAGT CATTTTTAT GTT 
T CAGAT GAAGCT CAGAACAAT G GCCT CATTGAT GCTTT T GGGGCT CTTACATCAGGAAATACT GATCTCTC CCAGAAG 
T C C CTT CAG C T CGAAAGTAAGG GATTAACACTGAATAGTAATGCCT GGATGAAC GACACTGT CATAATT GATAGTAC A 
GTGGGAAAGGACACGT TCT T T CT CAT CACAT GGAACAGT CT GC CT C CCAGTATTT CTCT CT GGGAT C CCAGT GGAACA 
AT AATG GAAAATTT CACAGT GGAT GCAACTT CCAAAAT GGC CTATCTCAGTATTCCAGGAACT GC AAAG GT GGG CACT 
TGGGCATACAAT CT TCAAGC CAAAGCGAAC CCAGAAACATTAACTAT TACAGTAACTT CT CGAGCAGCAAATTCTTCT 
GTGCCT CCAATCACAGTGAAT G CTAAAAT GAATAAGGAC GTAAACAGTTTCCC CAGCCCAAT GATT GTTTAC GCAGAA 
ATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCATTGAATCACAGAATGGACATACAGAAGTT 
TTGGAACTT TTGGATAAT GGT GCAGGCGCTGATT CTTT CAAGAATGATGGAGT CTACT CCAGGTATTTTACAGCATAT 
ACAGAAAAT GGCAGATATAGCTTAAAAGTTC GGGCT CAT GGAGGAGCAAACACT GCCAGGCTAAAAT TACGGCCTCCA 
CT GAATAGAGCC GC GTACATACCAGGCT GGGTAGTGAAC GG GGAAAT T GAAGCAAACCC GCCAAGACCT GAAATTGAT 
GAGGAT ACT CAGAC CAC CTT GGAGGATT T CAGCC GAACAGCATCCGGAGGT GCATT TGT GGTATCACAAGT CCCAAGC 
CTT CC CT T GCCT GACCAATACCCACCAAGT CAAAT CACAGAC CT T GAT GCCACAGT T CATGAGGATAAGATTATT CTT 
ACAT GGACAGCAC CAGGAGATAAT TT TGAT GTT G GAAAAGTT CAAC GTT ATAT CATAAGAATAAGT GCAAGTATT CTT 
GAT CTAAGAGACAGTTTT GAT GAT GCT CTTCAAGTAAATACTACTGAT CT GTCACCAAAGGAGGC CAACT CCAAGGAA 
AGCT TT GCATT TAAACCAGAAAATAT CT CAGAAGAAAAT GCAACCCACATATTT AT TGC CATTAAAAGT AT AGATAAA 
AGCAATTT GACAT CAAAAGTATCCAACATTGCACAAGTAACTT T GTT TAT C CCTCAAGCAAAT CCT GATGACATT GAT 
C CTACACCTACTC CTACT C CTACT CCTACTC CT GATAAAAGT CAT AATT CT GGAGT TAATATTT CTACGCTGGTATTG 
T CT GT GATT GGGT CTGTT GTAATT GT TAACTTTATTTT AAGTACCACCATT TGAA CCTTAACGAAGAAAAAAAT CTTC 
AAGTAGAC CTAGAAGAGAGTTTTAAAAAACAAAACAAT GTAAGTAAAGGAT ATT T CTGAAT CTTAAAATT CAT CCCAT 
GTGT GATCATAAACT CATAAAAATAATT TTAAGATGTCGGAAAAGGATACTTT GAT TAAATAAAAACACT CAT GGATA 
T GTAAAAACT GT CAAGATTAAAATT TAATAGTTT CATTTATT T GTT AT TTTAT TT GTAAGAAATAGT GAT GAACAAAG 
AT C CTTTTT CATACTGATAC CT GGT T GTAT ATTATT TGATGCAACAGT TTT CT GAAAT GATATT T CAAATT GCAT CAA 
GAAATTAAAAT CAT CTAT CT GAGTAGTCAAAATACAAGTAAAGGAGAGCAAATAAACAACATTT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 70 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFWSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
AANSSVPPITVNAKMNKDVNSFPSPMIVYAEILQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
GADSFKNDGVYSRYFTAYTEMGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVVNGEIEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI IRI SAS I LDLRDS FDDALQVNTTDLSPKEANSKES FAFKPENI SEENATHI FIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 
amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



:- T. 



CT C CT T AGGT G GAAACCCT GGGAGTAGAGTACT GACAGCAAAGACC GGGAAAGAC CATACGTC CCC GGGCAGGGGTGA 
CAACAGGT GT CAT CT TTT T GATCT C GTGT GT GGCT GCCT TC CTAT TTCAAGGAAAGAC GC CAAGGTAATTTT GAC CCA 
GAGGAGCAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACCCCCAGTTATGCCAGGATTTACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAA.CTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCT GT GAT G GGAAT GAAGGT GAAAACTT GGAGAT TTCACTTCAGT CAT T GCTT CT GCCT GCAAGAT CAT CCT TTAAA 
AGTAGAGAAGCTGCTCTGTGTGGTGGTTAACTCCAAGAGGCAGAACTCGTTCTAGAAGGAAATGGATGCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGA ATGA TGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
C GCAACTACGT GAG CAGCCT GAAG C GGCAGAT CGCACAGCT CAAGGAGGAGCT GCAGGAGAGGAGT GAGCAGCT CAGG 
AAT GGG CAGTACCAAGCCAGC GAT GCT GCT GGCCT GGGT CT GGACAGGAGC CC CCCAGAGAAAAC CCAGGCCGACCTC 
CTGGCCTT CCT GCACT CGCAGGT G GACAAGGCAGAGGT GAATGCT GGC GT CAAGCT GGCCACAGAGTAT GCAGCAGT G 
lj CCTT T C GAT AGCTTT ACT CTACAGAAGGT GTACCAGCT GGAGACT GGC CT TAC CCG CCAC CCCGAGGAGAAGCCT GTG 

AGGAAG GACAAGC GGGAT GAGTT G GT GGAAGCCAT T GAAT CAGCCT T GGAGAC CCT GAACAAT CCT GCAGAGAACAGC 
M CCCAATCACC GT CC T T ACACGGCCT CTGAT TT CATAGAAGGGAT CTAC C GAACAGAAAGGGACAAAGGGACAT TGTAT 

GAGCT C^CCTT CAAAGGGGACCACAAACACGAATT CAAACG GCT CAT CTT ATTT C GAC CAT TCAG C CC CAT CAT GAAA 
: : : GTGAAAAATGAAAAGCTCAACATGGCCAACACGCTTATCAATGTTATCGTGCCTCTAGCAAAAAGGGTGGAa\AGTTC 

C GGCAGTT CAT GC AGAAT TT CAGG GAGAT GT GCATT GAGCAGGATGGGAGAGT CCATCT CACT GTT GTTTACT TT GGG 
i„fl AAAGAAGAAATAAAT GAAGT CAAAGGAATACTT GAAAACACTT CCAAAGCT GC CAACTT CAGGAAC TTTAC CT T CATC 

;-|*J CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 

TTTTT CTGT GAT GT GGACATCTACTT CACAT CT GAATT CCT CAATACGT GT AGGCT GAATACACAGCCAG G GAAGAAG 
:j GTAT TTTAT C CAGT T CTT TTCAGT CAGTACAATC CT GGCATAATATACGGCCACCATGATGCAGTCCCT CC CTT GGAA 

CAGCAG CT GGT CATAAAGAAGGAAACTG GATTTT GGAGAGACTTT GGATT T GGGAT GACGT GT CAGTAT C GGT CAGAC 
TTCAT CAATAT AGGT GGGTTT GAT CT GGACATCAAAGGCT G GGGCGGAGAGGATGT GCACCTTTATCGCAAGTAT CT C 
CACAGCAACCTCATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATGGACG^ 
S CTGACC CCCGAGC AGTACAAGAT GT GCAT GCAGT CCAAGGCCAT GAACGAGGCAT C CC AC GGCCAGCT GGG CAT G CT G 

H GTGTTCAGGC^CGAGATAGAGGCTCACCTTCGCAAACAGAAACAGAAGACAAGTAGCAAAA^ 

*f GAAGGATT GT GGGAGACACTT TTT CTTT C CTTTT GCAAT TACT GAAAGT GGCT GCAAC AGAGAAAAGACTT CCATAAA 

H GGACGACAAAAGAATT GGACT GAT GGGT CAGAGATGAGAAAGC CT CCGAT TTCTCT CT GTT GG GCT TTTTACAACAGA 

J t AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGAC^\AAGGCAGAATGCTTGTG 
^ AGATTATAAGCCTAAT GGT GT GGAGGTTTTGAT GGT GTT TACAAT ACACT GAGACCTGTT GTT TT GT GT GCTCATTGA 

\\ AATATT CAT GATTTAAGAGCAGTTT T GTAAAAAAT T CATTAGCATGAAAGGCAAGCATATT T CT CCTCATATGAATGA 

M GCCTAT CAGCAGG GCT CTAGT TT CTAGGAAT GCTAAAAT AT CAGAAGGCAGGAGAG GAGATAGG CTTATTATGATACT 

AGT GAGTACAT TAAGTAAAATAAAATGGACCAGAAAAGAAAAGAAACCATAAATAT CGT GT CATATTT T C CC CAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTT TTT CCCTT GT GAGTTATAGTCT GCT TATTT AATTACCACT T T GCAAGC CTTACAAGAGAGCACAAGT T GGC CTAC 
ATT T TTATATT TTT TAAGAAGATACT TTGAGATGCATTAT GAGAACTTTCAGTT CAAAGCATCAAATT GAT GCCATAT 
C CAAGGACAT GCCAAATGCT GATT CT GT CAGGCACT GAAT GTCAGGCATT GAGACATAGGGAAGGAATGGT TT GTACT 
AATACAGACGTACAGATACTT T CT CT GAAGAGTAT TTTCGAAGAGGAGCAACT GAACACT GGAGGAAAAGAAAAT GAC 
ACTT TCTGCTT TACAGAAAAGGAAACTCAT T CAGACTG GT GATAT C GT GAT GTACCTAAAAGT CAGAAAC CACATTTT 
CTC CT CAGAAGTAGGGACCGCTT T CT TAC CT GTTTAAATAAAC CAAAGTATACC GT GT GAACCAAACAAT CTCTT T TC 
AAAACAGGGT GCT CCTCCTGGCTTCT GGCT T CCATAAGAAGAAAT GGAGAAAAATATATATATATATATATAT ATTGT 
GAAAGATCAAT CCAT CTGCCAGAAT CTAGT GGGAT GGAAGTTT TT G CTACATGT TATC CAC CCCAGGCCAGGT GGAAG 
TAACT GAATTATTTT T TAAATTAAGCAGTT CTACT CAAT CACCAAGAT GCT TCT GAAAATT GCATT TTATTACCATTT 
CAAACTAT T TT TTAAAAATAAATACAGTTAACATAGAGT GGTT TCTTCATT CAT GT GAAAATTATTAGC CAGCACCAG 
ATGCATGAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAAGAAC 
ATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACACAGG 
CCAT GAAT GGAAGGT GGTATT GCACAGCTAATAAAATAT GATTT GT GGATATGAA 
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a 
m 



fy 



MMMVT^GLLAWISRVWLLVLLCCAISVLTO 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEWAGVKIATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVK3STEKLNMAN 
TL INVI VPLAKRVDKFRQFMQNFREMC IEQDGRVHLTWYFGKEE INEVKGI LENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIVWTPWGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHE^ 
RKQKQKT S SKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 



APP ID=10063557 



Page 222 of 3 18 



FIGURE 73 



ns 1 



if! 



GAGACT GCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGAT TTGTCCTGGGGATCCA 
GAAACCCAT GATACCCTAC T GAACACC GAAT CCCC TGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACT ATG TTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGTGACATTTGACCCTGATTTGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
Q ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
5fl GCCCAGCT CCACC TGCACT GGGGT CAGAAAGGATC CCCAGGGGGGTCAGAACACCAGAT CAACAG 

W TGAAGCCACATTTGCAGAGCTCCACATTGTACAT TAT GACT CTGAT TCCTATGACAGCT TGAGTG 

AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCT TATGAACACATTCTGAGTCACTTGCATGAAGT CAGGCATAAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 

Q GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 

! fl 

Mi AT T TCAATGGAACAGCT GGAAAAGCT T CAGGGGACAT T GT T CTCCACAGAAGAGGAGCCCT CTAA 

2j1 GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 

i y 

P«| TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
f-y TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 

CCGAAAGAGT GTGGT CT TCACCTCAGCACAAGCCACGACTGAGGCATAAATT CC T T CTCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACT GTAGGAGT AGTAAGCAGAT GTCC TCCT T CCCC TGGACATCT CT TAGAGAGGAAT 
GGACCCAGGCT GT CAT T CCAGGAAGAAC T GCAGAGCC T TCAGCC T C TC CAAACATGTAGGAGGAA 
ATGAGGAAATC GC TG T GT T GT TAATGCAGAGANCAAAC T C T GT T TAGT T GCAGGGGAAGT T T GGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCTTAGGATAAAGAGTTGCTGTTGAAGTTGTATATTTTTGATCAATATATTTGGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 74 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryo tic- type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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3 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTT TGCC T AAACC T GC AAAC AT CACC T T C T TATCCAT CAAC AT GAAGAATGT CC TACA 
AT GGACTC CACCAGAGGGTC TT CAAGGAGT T AAAGT TAC T TACACTGT GCAGTAT T TCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGT GGAAGAGAAATCCAGAAGACCT TCCTGTTTCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGT C TGTGT T GAAT ACTAAAT CAAACAGAACGTGG T CCCAGT G TG T GACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
T CTGTGATGGGCTAT TCCATCTACCGATATATCCACGT TGGCAAAGAGAAACACCCAGCAAAT TT 
GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
ACTTTATCACCCTCAATATCTCGGAT GATTCTAAAATT TC TCATCAGGATAT GAGT TTACTGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 
AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
ACAGT CAT TGAAT ATGAATATGAT GT CAGAACCAC T GACAT T TGTGCGGGGCC TGAAGAGCAGGA 
GCTCAGTT T GCAGGAGGAGGTGTCCACACAAGGAACAT TAT T GGAGT CGCAGGCAGCGT TGGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGG AGCACACAGAC T C GGAGGAGG G G C C G GAGGAAGAGC CAT C GAC GACC C T GG T CGAC T GGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCC T T CT GAGGGGGATGGGCT CGGAGAGGAGGGTC T TC TAT C T AGACT CTAT GAGGAGCCGGCT 
CCAGACAGGCCAC CAGGAGAAAAT GAAACCTAT CTCATGCAATT CAT GGAGGAAT GGGGGT TAT A 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYOTSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSE FKAKI I FWYVLP I S I TVFLFSVMGYS I YRYIHVGKEKHPANL I L I YGNE FDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGC T GAAGGAC CACAAC GCCACCAGCATCCT GCAGCAGCT GCCGCT GC TCAGT GCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
T GAT CAAT GAAAAGGC TGCAGATAAGCTGGGATC TACCCAGATCGT GAAGAT CC TAAC T CAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T T TACACCAAAGGTGACCAACT TATACTCAAC T TGAATAACAT CAGCT CT GATCGGATCCAGCTG 
ATGAAC T C TGGGAT TGGC TGGT TCCAACCT GAT GT TC TGAAAAACAT CAT CACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGG7kAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 



MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGI PVLGSLVNTVLKHI I WLKVI TANILQLQVKPSANDQELLVKI PLDMVAGFNTPLVKT I VE 
FHMTTEAQATIRMDTSASGPTRLVLSDC7VTSHGSLRIQLLYKLSFLWAIAKQVMNLLVPSLPNL 
VKNQLCPVIEASFNGMYADLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KWFNNS AAS LTMPTLDNI PFS L I VS QDVVKAAVAAVL S PEE FMVLLDS VLPESAHRLKS S I GL IN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKYAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLI LNLNNI S SDRIQLMNSGI GWFQPDVLKNI ITE I IHS I LLPNQNGKLRS GVPVSLVKALG 
FEAAESSLTKDALVLTPASLWKPSSPVSQ 




Important features of the protein: 
Signal peptide: 



amino acids 1-21 



N-glycosylation sites. 



amino acids 48-51, 264-267, 401-404 



Glycosaminoglycan attachment site. 



amino acids 412-415 



LBP / BPI / CETP family proteins. 



amino acids 407-457 
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FIGURE 7Q 



GAGAGAAGTCAGCCTGGCAGAGAGAC TCTGAAATGAGGGATTAGAGGTGT T CAAGGAGCAAGAGC 
T TCAGCCTGAAGACAAGGGAGCAGT C CCTGAAGACGC T TC TACT GAGAGGT CTGCCATGGCC TCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CC AAGGGC C TC TGGATGGAATGT GCCACACACAGCACAGGCAT CACCCAGTG TGACAT C TATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGAC T CAGAGGATC CC TT TGCCC TCT GGT T TAC CTGGGAC TCCATCCCCAAACCCAC TAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CC TCCAAAGAAAC TGATTGGCCCT GGAACCT CCATCCCAC TCTT GT TAT GACTCCACAGTGT CCA 
GACTAATT T GTGCAT GAAC T GAAATAAAACCAT C C TAC GG TAT C CAGGGAACAGAAAGCAGGATG 
CAGGATGGGAGGACAGGAAGGCAGCC T GGGACAT T TAAAAAAAT A 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I Y S T LLGL PAD I QAAQAMMVT S S AI S S LAC 1 1 S WGMRC TVFCQE S RAKDRVAVAGGVFFI LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRS S PRPGQPPKVKSE FNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
% ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
M GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
;11 TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
IM AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
Ml ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 

in 

ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
Q AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
111 CAGACCCAAGGGAGCAGGCAGCACAAC T GT CAAGATCGTCC T GAAGGAGAAACATAAGAAAGCCT 

n 

GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
'M CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
m CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 

ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
C T CG T CCACACAT CGG TAT CCCCAAGCCCAGACAACC T GCGTCGCTTTGCCCT GGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GT GAAGTACC TGGCCCAAGGCCACACAGCCAGAATCT T CCACTT GAC T CAGATCAAGAAAGTCAG 
GAAGCAAGACTT CCAGAAAGAGGCACAGCAC TT CCGACTGC TCGC T GGCCC CCAC GAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACATAACAAAGACCTAACAGTTGCAGATATGAGCTGTATAATTGTTGT TAT TATATATTAATAAA 
TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



\ n 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 8s 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTC CATGCC GT GAGGTCCATT CACAGAAC AC ATCC ATGGCT CT CATGCT C AGT TT GGT T CT GAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGT T CT CT AGCGT GGT C C ACCTCT AC AGGG AC GGGAAGGACC AGCC ATTT AT GC AGAT GCCACAGT ATC 
AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
AC AG GT GT CAGC ACT GGG CTCAGT T CCT CTC AT TTC CAT C ACGGGATAT GTT GAT AGAGACAT C CAGCT AC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
AC AG AC TCCAGGAC AAACAGAGAC ATGCAT GGCCTGT TTG ATGT GGAG AT CT CT CT GACC GT CC AAGAGAA 
CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
GAGAAGAAAGC ACG GACAGGCAGAAT TGAGAG ACGC CCGGAAAC ACGC AGTGGAGGT GAC TC TGGATC C AG 
AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
CCTCACTCTGAGAAGAGAT T T ACAAGGAAG AGTGTGGTGGCT T CT C AGAGTT TCCAAGCAGGGAAACATT A 
CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
AG GAGT ACGT GACTTT GTCT CCCGATCATGGGT ACT GGGT CCT C AG ACTGAATGGAGAAC AT TT GT AT TT C 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCT TAT T GAGGCC CT AC AT TGAGTATCCGT C CT AT AAT GAGCAAAATGG AACT CCCATAGTCATCTGC 
CCAGTC ACCC AGGAAT CAG AGAAAGAGGCC TCT T GGCAAAGGGCCT CT GC AAT C CCAG AG ACAAGC AACAG 
TGAGTC CTCC TC AC AGGC AACCACGCCC TTCCT C CC C AGGGGTGAAAT G TAGGATGAAT C ACATCC CACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGAC ATT AC AT TT AGTTTGCT CT C ACT CC AT CT GGCT AAGT GATCT T GAAAT ACC ACCT CT CAGGTGAAG 
AACC GT CAGGAATTC C CAT C TC AC AGGCTGT G GTGT AG AT T AAGT AGAC AAGGAATGT GAAT AATGCT TAG 
ATCT TATT GATG AC AG AGTGTATC C T AATGGT TT GT TCAT TAT AT T AC AC TT TC AGT AAAAAAA 
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FIGURE 84 

MAMLSLVLSLLKLGSGQWQVFGPDKPVQ7VLVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSVVH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPL I S I TGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVE I SL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8s 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGAC ATG CTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATAGCAAGAAT TGCACCC TGAGCAT CAGAGAT GCCAGAAGAAG TGATGCGGGGAGA 
TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACCGGCTCTCTGTGAA 
M fl TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 

... .<g 

GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
jjj GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
Ml ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 

m 

: hM GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 

GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
jjl CT GAGGGAT GCAGCT GAAT TCACC TGCAGAGCTCAGAACCC TC TCGGCT CT CAGCAGGT CTACCT 

K| GAACGTCTCCCTGCAGAGCAAAGCCACAT CAGGAGT GAC TCAGGGGGT GGT C GGGGGAGCT GGAG 

| !; Jf CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
!-!f! TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGCTGATT C T TGTAGAAT TAACAGCCC T CAACGTGATGAGC T AT GAT AACAC TAT GAAT TATG 
T GCAGAGT GAAAAGCACACAGGCT T T AGAGTCAAAG TATC TCAAACCT GAAT CCACAC TGTGCCC 
T CCC T T T T AT TT T T T TAAC TAAAAGACAGACAAATTCC TA 
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FIGURE 86 

MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPAKAVWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFRMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
TCAT AGC GACCAC CAGAGGAT GGAG TACAGAT GAGGC TAAT ACTT ACT T CAAGGAATGGAC C T GT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
;U CAACTACAACAC CT T T GGAT C T GCAGAGGCGGC CACGAGCGAT GACTACAAGAACCCTGGCTACT 

Q ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 

AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 

PI 

: ; 2 G T T T G G CAT C T AC CAGAAAT AT C C AG T GAAATAT G GAGAAGGAAAG T G T T G GAC T GACAACGGCC 

MJ 

yf CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 

tjl GGCCAGCGGGAAT TCAC TGCGGGATT T GTT CAGT TCAGGGTATT TAATAACGAGAGAGCAGCCAA 

■M CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 

* GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 

j'S GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 

Q T TGAG AGT T T TGT GGGAGGGAACCCAGACCTCTCCTCCCAACCAT GAGATCCCAAGGAT GGAGAA 

fU CAACT TACCCAGTAGC TAGAATGT TAAT GGCAGAAGAGAAAACAATAAATCATAT TGAC T CAAGA 

W AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEAS PQQCGDFSGFDWSGYGTHVGYS S SRE I TEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8o 

CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCC ATGG GCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTC TGA TTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T TAAGACT TATATACAGT T T TAGGGGACAAT TAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE qi 

CT GGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATGCAAC TGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCC T CAGC CAAGGTGAAGAAAATCT T TGGCT GGGGCGAC T TC TACT CCAACAT CAAGACGGT G 
GC CCTGAAC CTGC TCGT CACAGG GAAGAT T GTGGACCAT GGCAAT GGGACCT T CAGCGTCCAC T T 
CCAACACAAT GCCACAGGCCAGGGAAACAT C TCCATCAGCC TCGT GCCCCCCAGTAAAGC TGTAG 
AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
GAGT GGGAGAAGG TAGAAC GGGGCCGCCGGACCT CGC T T TGCACCCACGACC CAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
T CTACATC GCC T T CTACAGCACGGACTATCGGC T GGTCCAGAAGGT GT GCCCAGAT TACAAC TAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCAT CC T TAAGCTAAGACAGGACGAT T GTGGT CC TCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
S VH FQHNAT GQGN ISIS L VP P S KAVE FHQEQQ I F I EAKAS K I FNCRME WE KVERGRRT S L C THDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyltransferase 

amino acids 61-71 
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FIGURE Q3 

CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGAT GGACCAACACAGAAATAT CT GCT GAT CT T T GGAGCG T TT GT CT CTG TC TATATCCAAGAA 
AT GTT CCGAT T TGCATAT TAT AAACT C T T AAAAAAAGCCAG T GAAGGT T TGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAG CC T GAAACT C TGCC TGC TCT GCCAAGACAAGAAC TTTCTTCTT TACAACCAGCGC TCCAGAT 
AACCT CAGGGAACCAGCAC T TCC CAAACCGCAGAC TACAT C T TTAGAGGAAGCACAAC TGTGCC T 
T T T TC T GAAAAT CCC T T T T TCT GGTGGAAT T GAGAAAGAAATAAAAC TATGCAGATA 
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FIGURE qa 

MTAAVFFGCAFIAFGPALTUiYVFTIAIEPLRIIFLIAGAFFWLVSLLISSLWFMARVIIDNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLPCKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site, 

amino acids 120-123 

Sodium neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 

AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTT TCCTTCT T TAAGTCTGATACCATTAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTT7U\ATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATTGACCC TGGGAGGGT TGAAT GTACAACAGCAACT GCAC CCACATGTGT TAC 
CAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
Q TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 

GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 

m 

G C AG GCAT C CAAAGGAG CACACAT GC CAT C GAGGAAG C CAC CACAGAAT C AG CAAAT G GAAT T C A 

GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
ml 

i|| AT T GAT TAT AT T ATGGAAT AGAT T GAGACACAT T G GAT AG T C T T AGAAGAAAT TAAT T C T TAAT T 

Hji TACCTGAAAATATTCT TGAAATTTCAGAAAATATGTTC TATGTAGAGAATCCCAACTTT TAAAAA 

^ CAAT AAT T CAAT GGAT AAAT CTGTCTTT GAAATATAACAT TAT GCTGCCTG GAT GAT AT GCAT AT 

k t T AAAAC AT AT T T GGAAAAC T G G AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Ml 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 



Signal peptide: 

amino acids 1-16 

y. 
m 

W 

m 



■ *? 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACC ATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGT CACCC T GGAGAT GCAGGAT GGACAT GT CT T GT GGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
M ? AC CAGCC T T GACCACGCCC T GGAGAGGAC CTACCAGC T GT TGGTACAGGTCAAGGACATGGGT GA 

Q CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
Q CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
m CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
! J TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
M<J AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
U1 ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
yi CAGT CCACCAGGTAC TGAAGTGAC TAGACTGTCAGCAGAGGAT GCAGAT GCC CCCGGC TCC CCCA 

■Zi ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
' % CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
y T CGAAGTCGCAGTCACAGATAT CAATGAT CACGCCCC TGAG TTCATCAC T T CCCAGAT TGGGCCT 

Ul ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
f*| CCTCGAGCCCGCCTTCCGCC TCAT GGATTT TGCCATTGAGAGGGGAGACACAGAAGGGACTTT TG 

;i;fs GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 

GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
Q AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
j-fj ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 

CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTT CACCC TTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T T CACC CAC TGGACCAT GT CAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGT GCCCCT 
GAAGGCGACTGTC TGAA TGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCC C CT GGGAGAGAGCCCAGCACCCAAGAT CCAGCAGGGGACAGGACAGAG TAGAAGCCCC TCCA 
T CT GCCCT GGGG TGGAGGCACCAT CAC CAT CACCAGGCATGT CT GCAGAGCC TGGACAC CAAC TT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE Q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVS 1 1 E S TWVS LE P I HLAENLKVL YPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVIAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVWWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHIIPVWSHNAQMWQLLTOVIVCRCNVEGQCMRKVGRMKGMPTKLSAVGILVGTLVAIGIFLI 



Mil 

\n LI FTHWTMSRKKDPDQPADSVPLKATV 



Signal peptide: 

amino acids 1-18 



Transmembrane domain: 

amino acids 762-784 
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FIGURE QQ 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGATT C T TCCCAGGAACACAAACGTAGGAGACCCACGC TCCT GGAAGCAC CAGCC T T TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATCTTGCTTTCCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAATG 
T T CT CC TTATGT T TGGT C TACTAT TGCAT T TAGAAGC T GCAACAAAT T CCAATGAGAC TAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
G CAT AG T CAC C AAC T C T GAG T T C C AT AC AAC C T C CAG T G GG AT C AG C AC AG C CAC C AAC T C T GAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAACTCT GAGT CCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
= . CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
Q CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 
iJJ CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 

GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
i t \ AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
; f 2 GCACAGCCACCAAC T C TGAC TCCAGCACAGT GTCCAGTGGGGCCAGCAC TGCCAC CAAC TC TGAG 

y) TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
Uf| GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
Sj CTGAGT CCAGCACAGT GTCCAGTGGGAT CAGCACAGTCACCAAT T CT GAGT CCAGCACACCCT CC 

AG T G G G GC CAACAC AGC CAC CAAC T C T GAG T CCAG T AC GACC T C CAG T G GG GCCAAC AC AGC CAC 
;L CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
y CCTCCAGTGGGG TCAGCACAGCCACCAAC T C TGAGT CCAGCACAACC TCCAGT GGGGC TAGCACA 

L.f1 GCCACCAACTC T GAC T CCAGCACAAC C T CCAGTGAGGCCAGCACAGCCACCAAC T C TGAGTC TAG 

Q CACAGTGT CCAGTGGGATCAGCACAGTCACCAAT TCTGAGTCCAGCACAACCTCCAGTGGGGCCA 

ajj ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
;4 ATGCACACAACT T CCCATAGTGCATC TAC T GCAGTGAGTGAGGCAAAGC CT GG TGGGTCCCT GGT 

^ GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
ftl TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 

CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GT GGAG TCCT AAC TGGTTCTGGAGGAGACCAGTATCAT CGATAGC CAT GGAGAT GAGC GG GAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATC TT GAAGAAGGTAT T CC TCACCT TTCT TGCC TT TACCAGACACT GGAAAGAGAATACTATAT 
T GCT CAT T TAGC T AAGAAATAAAT ACAT CT CAT C TAACACACACGACAAAGAGAAGC T GT GCT T G 
CCCCGGGGTGGG TATC TAGC TC T GAGAT GAACT CAGT TATAGGAGAAAACC T CCAT GC T GGAC TC 
CATC T GGCAT T CAAAATC T CCACAGT AAAAT CCAAAGACC TCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRAS TATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSAS TAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 



& AMEMSGRNSGP 

m 



:|« Signal peptide: 

,j1 amino acids 1-20 



-J Transmembrane domain: 

! !: 2j amino acids 510-532 

U! 

a** 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGTTTCTAGCTACTGCACGTACTTCATAAAGCAGGACTCTAAAAGCTTTGGAATCAT 
GGT G T CAT GGAAAGGGAT T TAC T T TATAC T GACTCT GT T T TGGGGAAGCT T TT T TGGAAGCAT TT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
y s CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
Q GATTATAACTGGGGATGCATTTGTTCCTGGAGAAAGAAGTGTCATTATCATGAACCATCGGACAA 
p GAAT GGAC TGGATGT TCCT GTGGAAT T GCCTGATGCGATATAGCTACC TCAGAT TGGAGAAAAT T 

yl TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
^ C T TCAT T CATAGGAAATGGAAGGAT GACAAGAGCCAT TT CGAAGACAT GAT TGAT TAC T T T TGT G 

= p ATATTCACGAACCACTTCAACTCCTCATATTCCCAGAAGGGACTGATCTCACAGAAAACAGCAAG 
\jj TCTCGAAGT AAT GCAT T TGCTGAAAAAAATGGAC T T CAGAAATATGAATAT GTT T TACATCCAAG 

AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATAT CACT GTGGCGTAT CC TCACAACAT T CC TCAAT CAGAGAAGCACC TCCT CCAAGGAGAC TT T 
m CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
fj] CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
C3 GGGAGAAGAAT TTT TATT TTACCGGACAGAGTGTCAT TCCACCT TGCAAGTCTGAACTCAGGGTC 

^ CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATATT TGGTGGACT GGAGAT CATAGAAC TTGCATGTTACCGACT T T TACACAAACAGCCACAT 
T T AAAT T CAAAGAAAAAT GAG TAA GAT TAT AAGG T T T GC CAT G T GAAAACC T AGAGCATAT T T T G 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCAT CAT TAT T TGT T AAAGATAT TT TGCACT TAAT T T TG TGGGAAAAATAT TGCTACAAT T TT T 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGGC CAGAATAT TAT TAAACAAT CAT CAGGCT T T TAAA 
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FIGURE 102 

MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPmFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVIITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 102 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCT CTACAGAGATGGGGAAGACT GGGAAT C TAAGCAGATGCCACAGTAT CGAGGGAGAAC TGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
3 GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
J;J TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
f* AAAT CCAAAGGGAAAATCCAGGCGGAACTGGAC T GGAGAAGAAAGCACGGACAGGCAGAAT TGAG 

AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACGCGAAGCTCTGCG 
T T TC T GAT CT GAAAACTGTAACCCATAGAAAAGC TCCC CAGGAGGTGC CTCAC TC TGAGAAGAGA 
M TT TACAAGGAAGAGTGTGG T GGC TTCT CAGGGT T TCCAAGCAGGGAGACAT TAC T GGGAGGT GGA 

:; ff CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
L GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
3 TGACATGT CAGT T TGAAGGC TTGTT GAGACCCTATAT CCAGCATGCGAT GTATGACGAGGAAAAG 

jj GGGAC T CCCATAT TCATAT GTCCAGT GTCCT GGGGATGAGACAGAGAAGACC CT GC T TAAAGGGC 

CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 
C T CCAT C CAGC T AAGC GAT C T T GAA.CAAG T CAC AACC T CC CAGGC T C C T CAT T T GC TAG T C AC GG 
ACAGTGAT T CCT GCC TCACAGGT GAAGAT TAAAGAGACAACGAAT GT GAAT CATGC T TGCAGGT T 
TGAGGGCACAG T G TT TGCTAATGATGT GT T T TTATAT TATACAT T T T CCCACCATAAAC TC TGT T 
T GCT TAT T CCACATTAAT T TACT TT T C TC TATAC CAAATCACCCATGGAATAGT TATTGAACACC 
TGCT TT GTGAGGC TCAAAGAATAAAGAGGAGGTAGGATTTT TCACTGATTCTATAAGCCCAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
C T CAT TAACACAGACACAAAAAT TCTAAATAAAAT T T TAACAAAT TAAACTAAACAATATAT T TA 
AAGATGATATATAACTACTCAGTGTGGTT TGTCCCACAAATGCAGAGT TGGT TTAATATT TAAAT 
AT CAAC CAGTGTAAT TCAGCACATTAATAAAGTAAAAAAGAAAACCAT AAAAAAAAAAftAAAA 
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FIGURE 104 

MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMSVRFFRNQFHAVVH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGWMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE 1Q5 

CCT T CACAGGACT CTT CAT T GCTGGT TGGC AATGAT GT AT CGGC C AGATG TGGT GAGGGC TAGG AAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACT GT TC AT T AT GTGAGAT ATAATC AAAAGAAG AC CT ACAATT AC T ATAGC AC AT T GTC ATTT AC AAC 
TGACAAACTATATGCTGAGTTTGGCAGAGAGGCTTCTAACAATTTTACAGAAATGAGCCAGAGACTTGAAT 
C AAT GGT G AAAAATGC ATTTTAT AAATCTCCAT T AAGGGAAGAATT TGT C AAGT CT CAGGTT ATC AAGT TC 
AGT C AAC AGAAGCAT GGAGTGT TG GCTC AT ATGCT GTT GATT TGTAGATT TCAC TCT ACT GAGGAT CCT G A 
AACT GT AGAT AAAAT T GT T CAACT TGTT TT AC ATGAAAAGCT GC AAGAT GCT GT AGGACCCCC TAAAGT AG 
ATCCTCACTCAGTTAAAATTAAAAAAATCAACAAGACAGAAACAGACAGCTATCTAAACCATTGCTGCGGA 
ACACGAAGAAGTAAAACTCTAGGTCAGAGTCTCAGGATCGTTGGTGGGACAGAAGTAGAAGAGGGTGAATG 
Q GC CCTGGCAGGCTAGCCTGCAGTGGG AT GGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCC AC AT GGC 

h-J T T GT GAGTGC TGCT CACT GTTT TACAACAT AT AAGAACCCT GCC AGAT GG ACT GCT TCCT TT GGAGT AAC A 

?'.? ATAAAACCTTCGAAAATGAAACGGGGTCTCCGGAGAATAATTGTCCATGAAAAATACAAACACCCATCACA 
?Z TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
j"p TCCCTGATGC AT CCT ATG AGTT TC AACCAGGTGATGTGAT GT T T GT GAC AGGAT TTGG AGCACTGAAAAAT 

%l GATGGT T AC AGT CAAAAT C ATCTT CGACAAGC AC AG GTGACT CT C ATAGACGCT ACAACT TGC AAT GAACC 

TC AAGCTT AC AATGAC GCCAT AACT CCT AGAAT GTT ATGT GCT GGCTCCT TAGAAGGAAAAAC AGATGC AT 
f|1 GCCAGGGTGACT CTGGAGGACCACTGGT TAGTT C AGAT GC T AG AGATATCTGGT ACCTT GCT GG AAT AGTG 

y*| AGCT GGGG AG AT GAAT GT GCGAAACCC AACAAGCCT GGTGTT T AT ACT AGAGTT ACGGCC TT GCGGGACTG 

Q GATTACTTCAAAAACTGGTATC TAA GAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 
He! GGTGTGGAGGCC ATTT TT AG AGAT ACAG AATTGGAG AAGACT T GCAAAAC AGCT AGAT TT GACT GATC TC A 

M ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 

fit 

; GAT C AACTCTGT CATC TGT G AGCAAT AG TT GAAACT TT AT GT AC AT AGAGAAAT AG AT AAT ACAAT AT TAC 

AT T ACAGCCTGT AT TC AT T TGT TCTCT AGAAGTT TT GT CAGAAT TT TGAC TT GT TGAC AT AAATTT GT AAT 
GC AT AT AT AC AATTTGAAGC ACTCCTT T TCTTCAGT TCCT C AGC TCCT CT CAT T TCAGCAAATATCCATTT 
T CAAGGTGCAGAACAAGGAGT GAAAGAAAAT AT AAGAAGAAAAAAATCCCCT AC ATTTT ATT GG CACAGAA 
AAGT AT T AGGTGTTTTTCT T AGTGGAAT ATT AG AAATGAT CAT AT T CATT AT GAAAGGT C AAGCAAAG AC A 
GC AGAAT ACCAATCACTT C ATCAT TT AG GAAGT ATGGGAACT AAGT TAAGGAAGT C CAGAAAGAAGCC AAG 
AT AT AT CCT T AT TTTC AT T TCCAAAC AACT ACT AT G AT AAATGT GAAGAAGATTCT GT T T TT TT GT GACCT 
AT AAT AATT AT ACAAACT T C ATGC AAT G T ACTTGTT CT AAGC AAAT TAAAGC AAAT ATT T AT TT AACATTG 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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FIGURE 106 



m 



MMYRPDVVRARKRVCWEPWVIGLVIFISLIVLAVCIGLTVHYVRYNQKKTYNYYSTLSFTTDKLY 

AEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVLAHM 

PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 

TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYfvNFPARWTAS FGVTIKPSKMKRGL 

RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYE FQPGDVMFVTGFGALKNDGYS 

QNHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 

IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 



;5 
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FIGURE 1Q7 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAAT GT T C TCACT T TAAC T GCAGAAACGAGGGTAGAGGAAGCAGTCAT T T TGAC TTAC T T T CC T 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GATATGGTCACT TTGAAAGCCAGGATGACAAAT TATGGATTACCTAGATAT CGGTGGCT TACTCA 
T GCT T GGAAT x T TT T T CAGAGAGAGTT TAAGTGC TGTGGAGTAGTATAT T T CAC TGACTGG T TGG 
AAAT GACAGAGATGGACTGGCCCCCAGAT TCCTGCTGTGTTAGAGAATTCCCAGGATGT TCCAAA 
CAGGCCCACC AGGAAGATC TCAGTGACCT T TATCAAGAGGGTT GT GGGAAGAAAATGTATT CC T T 
;3 TTTGAGAGGAACCAAACAACTGCAGGT GC T GAGGT T T CT GGGAAT CT CCAT T GGGGTGACACAAA 

;*fj TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
I ACAGACCAAATGATGTCC T TGAAGAAT GACAAC T C T CAGCACC T GTCATGTCCC T CAG TAGAACT 

l ff GTTGAAACCAAGCCTGTCAAGAATCTTTGAACACACATCCATGGCAAACAGCTTTAATACACACT 
Ml TT GAGATGGAGGAGT TATAAAAAGAAATGT CACAGAAGAAAACCACAAAC T T GT T T TAT TGGACT 

U1 T GTGAAT T T T TGAGTACATACTATGT GTT T CAGAAATATGTAGAAATAAAAATGT TGCCATAAAA 

%J TAACACC T AAGCATATAC TATT CTAT GCT T TAAAATGAGGATGGAAAAG T T T CAT GT CATAAGT C 

ACCACCTGGACAATAATTGATGCCCTTAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGCC 
1 ; . TGTGT AT GACT T T TAC TGAACACAGT TAT GT T T TGAGGCAGCAT GGT T TGAT TAGCAT T TCCGCA 

Q T CCAT GCAAACGAGT CACATATGGT GGGAC T GGAGCCATAGTAAAGGT TGAT TTAC TT CTACCAA 

yl C TAGT AT AT AAAGTAC TAAT TAAATGCTAACATAGGAAGT TAGAAAAT ACTAATAACT TT TAT TA 

?*% CTCAGCGAT CTAT TC T TC TGAT GC TAAAT AAAT TATATATCAGAAAACTT T CAATAT T GGTGACT 

j!n ACC TAAAT GTGAT T T T TGC TGGT TAC TAAAATATTC T TACCACT TAAAAGAGCAAGC TAACACAT 

TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
Q T T CAG T T C T GAT AAT G T T AAGAAT AAC CAT TAT GAAAAGG AAAAT TTGTCCTG T AT AGCAT CAT T 

ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGTTATTTAAATACTTAACCACTAATTTTGAAAATTACCAGTGTGATACATAGGAATCATTATTC 
AGAAT GT AGTC TGGT C T T TAGGAAGTAT TAATAAGAAAAT TTGCACATAAC T TAGT TGATT CAGA 
AAGGACT T GTATGCTGTT T T TC TCCCAAAT GAAGAC T C TT T TT GACACTAAACAC T T T T TAAAAA 
GCTTATCT T TGCCTT C TCCAAACAAGAAGCAATAGTCTCCAAGT CAATAT AAAT T C TACAGAAAA 
TAGTGTTCTTTTTCTCCAGAAAAATGCTTGTGAGAATCATTAAAACATGTGACAATTTAGAGATT 
C T TT GT TT TAT T T CAC TGAT TAATATACTGT GGCAAAT TACACAGAT TAT TAAAT T TT T T TACAA 
GAGT AT AG T ATAT T TAT T T GAAAT GGGAAAAG T GC AT T T TAC T G TAT T T T G T G TAT T T T G T T TAT 
T T CT CAGAATATGGAAAGAAAAT TAAAATGT GT CAATAAATAT T T TCTAGAGAGTAA 
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FIGURE 108 



MAREDSVKCLRCLLYALNLLFWLM^ 

VMIAVCCFLI I VGMLGYCGTVKRNLLLLAWYFGSLLVI FCVEIACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQRE FKCCGWYFTDWLEMTEMDWPPDS CCVRE FPGCSKQAHQ 
EDLSDLYQEGCGKKMYS FLRGTKQLQVLRFLGI S IGVTQI LAMI LT I T LLWALY YDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE lOQ 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGT CTT T CT C GT GAGAGC C T AGAGGC C T T AAAAAAAAAAGT GCTT GAAAGAGAAGGG GAC AAAG GAAC A 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGG ATG CCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCC GT GG CCCT AGATCCTT GTTCT GCTTACAT CAGCCTGAATGAGC 
CCT GGAGGAAC ACT GACCACCAGTT GGAT GAGTCT CAAGGTCCTCCT CT AT GTGACAACCATGTGAAT GGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGAC GGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
C AGCT GCTCAGAT AC C AGCGAGT GCACATGC GCTC CAGGAACT GTGCTAGGCCCT GACAGGCAGAC AT GCTTTG 
ATGAAAATGAATGTGAGCAAAACAACGGTGGCTGCAGTGAGATCTGTGTGAACCTCAAAAACTCCTACCGCTGT 
GAGT GT G GGGT T GGC CGT GT GCT AAGAAGT GAT GGC AAGACT T GT GAAGAC GT T GAAGGAT GCC AC AAT AAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CT G AGGAT AAC C AC ACT T GCC AA GT CC C TGT GT T GT GC AAAT C AAAT GCC AT T GAA GT GAAC AT C CCC AGGG AG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCT CTTCTCTCT CAAGAC AT GT G GT AC AGT GGT CG AT GT GGT GAAT GAC AAG AT TGT GGC C AGC AACC T C GT GA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACCT GC GAGT T TCCAC GCCT GT AC ACC ATT T CT GAAGGAT AC GT T C C C AACC T T C GAAAC T C CCC ACT GGAAAT 
CAT GAGC C GAAATCAT GGGAT CT TC CC ATT C ACT C T GGAGAT C T TC AAGGAC AAT GAGTT T GAAGAGC C T T AC C 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTT GGTGGAGAGCT GCTTT GCCACCCCCACCTCCAAGAT CGACGAGGTCCT GAAAT ACTACCTCAT 
CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CAC AAAAT CAG AAGC T GGGT AT AAT AT T T C AAGT T AC AAACCCT AGAAAAAT T AAACAGT TACT GAAAT TAT GA 
CTT AAAT ACCC AAT GACTCCTT AAATATGT AAATT ATAGTT AT ACCTTGAAATTT C AATT CAAAT GCAGACT AA 
T T ATAGG GAAT T TGGAAGT GTAT C AAT AAAACAGT AT AT AATTTT 
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FIGURE no 



: :>:: 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDBMP 
TFCIPENHCGTHAPWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CE FPRL YT I SEGYVPNLRNS PLEIMSRNHGI FPFTLEI FKDNE FEEPY RE ALPTLKLRDSLY FGIE PWHV 
SGLE SLVESCFATPTSKI DE VLKY YLI RDGCVSDDS VKQYTS RDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 



Tyrosine kinase phosphorylation sites. 

Ms amino acids 411-418, 443-451 



N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 



m 



GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCT CCTCCAGCCAGT GCT G ACCAGGGACT T C T G AC CT GCT GGCCAGCCAGGAC CT GT GT GGGGAGGC CCT 
CCT GCT GCCT TGGGGT GACAAT CT CAGCTCCAGG CT AC AGGGAGACCGGGAGGATC AC AGAGCC AGC ATG T 
T AC AGG AT CCTGAC AGTGAT CAAC CT CT GAAC AGC CTCGATGT C AAAC CC CTGC GC AAACCC C GTATC CCC 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGC AGCT GTGT G ACGGAGAGCT GGACT GT CCCT T G GGGGAGGACGAGGAGC AC T GTGTC AAGAGC TTC 
CCCGAAGGGC CT GCAGTGGC AGTCCGCC TCTCC AAGGACC GATC C ACACT GC AGGT GCT GGACTCGGC CAC 
AGGGAACTGGTT CT CT GCCTGT TT CG AC AACTTC AC AG AAGCTCT C GCTG AG AC AGCCTGTAGGC AGATGG 
GCTACAGCAGAGCTGTGGAGATTGGCCCAGACCAGGATCTGGATGTTGTTGAAATCACAGAAAACAGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TCAGCATCCAGTACGACAAACAGCACGTCTGTGGAGGGAGCATCCTGGACCCCCACTGGGTCCTCACGGCA 
GCCC AC TGCT TC AGGAAACAT ACC GATGT GTT C AACTGGAAGGT GC GG GC AGGCTC AGAC AAACT G GGCAG 
CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATG AGGAGC TCACTCCAGCCACCCC ACTC TGG ATC ATTGGAT GGGGCT T TACGAAGC AGAATGGAGGGAA 
GATGTCTGACATACTGCTGCAGGCGTCAGTCCAGGTCATTGACAGCACACGGTGCAATGCAGACGATGCGT 
ACCAGGGGGAAGTCACCG AGAAGATGAT GTGT GC AGGC AT CCCGGAAGGGGGT GTGGAC ACC TGCC AGGGT 
GACAGT GGTGGGCCCCTGATGT ACC AAT CT GACC AGTGGC AT GTGGTGGGC AT C GT T AGCTGGGGCT AT GG 
CTGC GGGGGCCC GAGC ACCCC AGGAGT AT ACACC AAGGTC TCAGCC TATC TC AACT GGATCT AC AATGTCT 
GGAAGGCTGAGCTG TAA TGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGAT CCCCCAAAGT CAGACACAGAGC AAGAGT CC CCT T GGGT ACACCCCT CT GC CCAC AGCCT C AGCAT 
TTCT TGGAGCAGCAAAGGGCCT CAAT TCCT GT AAGAGACC CT CGCAGCCC AGAGGC GCCC AGAGGAAGTC A 
GC AGCC CT AGCTCGGC CACACT TG GTGCTCCC AG CATCCC AG GGAGAGACACAGCC C ACT GAAC AAGGTC T 
CAGGGGTATTGCTAAGCCAAGAAGGAACTTTCCCACACTACTGAATGGAAGCAGGCTGTCTTGTAAAAGCC 
C AGATC AC TGTGGGCT GGAGAGGAGAAGGAAAGGGT CTGCGC C AGCCC TGTCCGTCT T C ACCCATCCC CAA 
GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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FIGURE 112 

MLQDPDSDQPLNSLDVKPLRKPRI PMETFRKVGIPI I IALLSLAS I I IWVLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKS FPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KT PRWGGEE AS VDS WPWQVS I Q YDKQHVCGGS I LDPHWVLTAAHC FRKHT DVFNWKVRAG S DKL 
GSFPSLAVAKI I I IEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWI IGWG 
FTKQNGGKMS D I LLQASVQV IDS TRCNADDAYQGEVTEKMMCAG I PEGGVDTCQGDSGGPLM YQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE 113 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGAT TGA CAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
; W AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 

31 ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 

i ii 

CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 

Hi GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 

1X1 

\\ GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 

CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
53 CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 

■ HI 

M! TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 

CTCCTTCCCTC TGAGAGGCCCTCC TAT GT CCC TAC TAAAGCCACCAGCAAGACATAGC T GACAGG 
m GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
n| GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 

CT CAAT T TAAAT CATGT TC TAGTAAT T GGAGCT GTCCCCAAGACCAAAGGAGCTAGAGC TT GGT T 
CAAATGAT CT CCAAGGGCCC T TATACCCCAGGAGACT T TGATT TGAAT T T GAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
T T TGGGAGGCCGAGGCGGGTAGATCAC CT GAGG TCAGGAGT TCAAGACCAGCCTGGCCAACAT GG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCT ACT CGGGAGGC TGAGACAGGAGAAT TAC T T GAACC TGGGAGG T GAAGGAGGC TGAGACA 
GGAGAATCACT TCAGCC TGAGCAACACAGCGAGAC T C TGTC TCAGAAAAAATAAAAAAAGAAT TA 
TGGTTATTTGTAA 
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FIGURE lid 



MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE us 

CAGCAGTGGT CT C T CAGT CCTCTCAAAGCAAGGAAAGAGTACT GTGT GC T GAGAGACCATGGCAA 
AGAATCCT CCAGAGAAT T GTGAAGACT GT CACAT TC TAAAT GCAGAAGCT T T TAAAT CCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TC TACAGCAAT GGAGAGAAGAAGAAGATT TACATGGAAAT T GAT CCT GT GACCAGAACTGAAATA 
T TCAGAAGCGGAAAT GGCAC TGAT GAAACAT T GGAAGTGCACGAC TT TAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
C T G AAC C AGAAG AG GAAAT AG AT G AG AAT G AAGAAAT T AC C AC AAC T T T C T T T G AAC AG T C AG T G 
AT TT GGGT CCCAGCAGAAAAGCCT AT T GAAAACCGAGATT T TC T T AAAAAT T CCAAAAT TC T GGA 
GATT T GT GATAACGT GACCATGTAT T GGAT CAATCCCACTC TAAT AT CAGT T TCTGAGT TACAAG 
ACTT T GAGGAGGAGGGAGAAGATCT T CAC T T TCC TGCCAACGAAAAAAAAGGGAT TGAACAAAAT 
GAACAGT GGGTGGTCCC T CAAGT GAAAGTAGAGAAGACCCGTCAC GCCAGACAAGCAAGTGAGGA 
AGAACTT CCAATAAAT GAC TATACT GAAAAT GGAATAGAAT TT GATC CCAT GC TGGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
T GCT GGCAACATATAATAAAT GCAT GC TAT TCAATGAATT T CT GCCTAT GAGGCATCT GGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
T TAT CAC C AAAAAAAAAAAAAAAAAAA 
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FIGURE 116 

MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
E FSEPEEE I DENEE I TTT FFEQS VIWVPAEKPIENRDFLKNSKI LE ICDNVTMYWINPTL I SVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 
amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

GlycosaminoglYcan attachment sites . 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATC ATG TCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TC AC ACTAAT TGGGGGT G TG AT GAT G TGCAT CGCCT GCCGGG GCCT GGC ACC AGAAGAAACC AACT AC AAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
TGGGTC CAAC ACCAAAAACAAGAAGATAT ACG AT GGAGGT GC C CGCAC AGAG GACGAGGT AC AATCTT AT C 

CTTC CAAGCACGACT AT G TG TAA TGC TCT AAGACCT CTCAGC ACGGGCGGAAGAAACT CCCGGAGAGC TC A 

'"•;t I 

[ fl CCCAAAAAAC AAGGAGATCCCATCT AGATT T CTT CT TGCT TT T GACTC AC AG CT GGAAGT T AG AAAAGCC T 

l ! 

CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
iii GCTGAGTT AT TT ATGAAT TAGAGGCT AT AGCT CACATTTT CAAT CCTCT AT TTCTTTTTT TAAAT AT AAC T 

lj{ TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 
Ml TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 

GAGT AGACCC AAAGAT GTTATT TT CT GCTGT T TGAATTTT GT CTCC CC ACCC CC AACT TGGCTAGTAATAA 
AC AC T T ACTGAAGAAGAAGC AAT AAGAGAAAGAT AT TTGT AAT C TCTCCAGC CC ATGATC TCGGTT TT CT T 
ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TT GAC ATCTT CT T AT T AC AGCAAC AC CATT CT AGGAGTT T CC TGAGCT CTC C AC TGGAGT CCTCTT TC TGT 
CGCG GG TCAGAAAT T GTCCCT AGAT G AATGAGAAAATT AT TTTTTT T AATTTAAGTCCT AAAT ATAGTT AA 
AATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGAAATAGCCTCACCCCTACATGTGGATAGAAG 
GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
AT GAAAAGCT CACACC TGT AAT CCTAGCACTTTGGG AGGC TGAGGAGG AAGG AT C ACT TGAGCCC AGAAGT 
T C GAGACT AGCCTGGGCAAC ATGGAGAAGC CCTGTCTCT ACAAAAT AC AGAG AG AAAAAATC AGC C AGTC A 
T GGT GGCAT ACACC T GTAGT CCCAGCAT TC CGGGAGGCT GAGGTGGGAGG AT CACTTG AG CCC AGGGAGGT 
T GGGGC TGC AGT GAGC CATGAT CACACC AC TGCACT CCAGCCAGGT GAC AT AGC GAGATC CTGTCT AAAAA 
AATAAAAAAT AAAT AATGGAAC AC AGC AAGTCCT AGGAAGTAGGTT AAAACT AATT CT TT AA 



Mi 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRALMIVGIVLGAIG 

CAIAGVSVFANMLVTNFWMSTANMYTGMGGMVQTVQTRYT FGAALFVGWVAGGLTLIGGVMMC I A 

CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCAT CCCAGAGGATA ATG GCAACCCATGCCTTAGAAAT CGCTGGGCT GTTTCTT G 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
'jjf ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
:;p CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACTCGATACCTTCCCATCGCACAACCCAAAAAAGTTATCACACCGGAAAGAAGTCACCGAGCG 
Ifi TCTACTCCAGAAGTCAGTATGTG TAG TTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
i ; f| ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
*S| AATCTTAATTACAGGAACTGT GCATCAGCTATTTAT GAT TCTATAAGCTAT T TCAGCAGAATGAGATA 

TTAAACC CAATGCTT TGATT GTT CTAGAAAGTATAGTAATT TGT TTTCTAAGGTGGT TCAAGCATCTA 
P| CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
4* ACGACATAGCATTAT GTACATAGATGAGTGTAACAT T TATATCTCACATAGAGACAT GCTTATAT GGT 

py TTTATTTAAAATGAAATGCCAGTCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 

ATCATGGATAGGGTT GAAGAAGGT TACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 
flj TTTATAAT GAAGATTAAAAT GAAGGCT T TAAT CAGCATT GTAAAGGAAATTGAATGGCTT TCT GATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGT GTATTAAATTAACAT TTT TAAAACGCAGATAT TT T GTCAAGGGGCTT T GCAT TCAAACTGCT T 
T TCCAGGGCTATACTCAGAAGAAAGATAAAAGT GTGATCTAAGAAAAAGTGAT GGTT TTAGGAAAGT G 
AAAATATTTTT GTTT TT GTATTTGAAGAAGAATGAT GCAT T TTGACAAGAAAT CATATAT GTAT GGAT 
ATAT T TTAATAAGTATT TGAGTACAGACT TTGAGGT TTCAT CAAT ATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACATT TATAT TAATAAAT TGTACATT TT TCTAAT T 
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FIGURE 120 



mTHALEIAGLFLGGVGWGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQANIRMQCK 
I YDS LLALS PDLQAARGLMCAASVMS FLAFMMAI LGMKC TRCTGDNEKVKAH I LLTAG 1 1 F 1 1 TG 
MWL I PVSWVANAI IRDFYNS I VNVAQKRELGEAL YLGWTTALVL I VGGAL FCCVFCCNEKS S S Y 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



.< IS:. 



VI 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT T CAAAGGAGAAAAGGGGGAAT GT CT GAGGGAAAGC TT TGAGGAGT C CT GGACACCCAAC TAG 
AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
AT TGAAGC TATAAT T TAT T T GGACCAAGGAAGCCCT GAAATGAAT TCAACAAT TAATAT TCATCG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
TTGGCACT TGTTCAGAT TACCCAAAAGGAGATGCTTC TACTGGAT GGAATTCAGT TTCTCGCATC 
AT TAT T GAAGAAC TACCAAA ATAAA TGCT T T AAT T TTCAT T T GCTACCT CT T T T T T TAT T ATGCC 
T T GGAATGGT TCACT TAAATG ACAT T T TAAATAAG T T TAT GTATACAT CTGAAT GAAAAGCAAAG 



£3 CT AAAT AT G T T TACAGACCAAAGTGTGAT TT CACACT GTT T TT AAATCTAGCAT TAT T CAT T TTG 

CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
'■*?} CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 

AAAAAATATAAAAGC TACCAATC T T TGTACAAT T T GTAAATGT TAAGAAT x T TT T T TATATC TGT 
TAAATAAAAATTATTTCCAACA 



ru 
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FIGURE 122 

MRPQGPAAS PQRLRGLLLLLLLQLPAPS SASE I PKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 



Signal peptide: 

amino acids 1-30 



[ ztt Transmembrane domain: 

m amino acids 195-217 

W 

® 

la 
m 
w 

yS 
fu 

ri 

ry 
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FIGURE 122 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT TGCCT GGAAGAATACAT CATGT T T T TCGATAAGAAGAAAT T GT AGGATC CAG TTTTTTTTT TA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCC T AAGCCAAAGCAAAAGACC T AAGGACGACC T T TGAACAATACAAAGGATGGGT T TCAATG 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
ATAACAGCCTTCAAAAACTTAAGTATAATCAATTTAAAGGGCTCAACCAGCTCACCTGGCTATAC 
C T TGACCATAACCATATCAGCAATAT T GACGAAAATGC T T T TAAT GGAATAC GCAGAC T CAAAGA 
GCTGAT TCT TAGTTCCAATAGAATCTCCTAT TT TCTTAACAATACCT TCAGACCTGTGACAAATT 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
TCTTTGCTGGCATGATCAGACTCAAAGAACTTCACCTGGAGCACAATCAATTTTCCAAGCTCAAC 
y-| CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
£ j CATAGGACAGACCATGTCCTGGACCTGGAGCTCCTTACAAAGGCTTGATTTATCAGGCAATGAGA 
^ TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
% T C CAACAAGCTCACAT T TATTGGTCAAGAGATT T TGGAT TC TT GGAT AT CCC TCAATGACATCAG 

Q TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
iff T T AAAGGT CTAAGGGAGAATACAAT TATC T GTGCCAGTCCCAAAGAGCT GCAAGGAGTAAATG T G 

M AT CGAT GCAGTGAAGAACTACAGCAT C TGT GGCAAAAGTAC TACAGAGAGGT TTGAT C TGGCCAG 



GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGT C CC TAAAGCAAATGACT CCCAGCACCCAGGAAT T T TATGTA 
GAT TATAAACCCACCAACACGGAGACCAGCGAGATGC TGC TGAAT GGGACGGGACCCTGCACC TA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGTATGAACCATTGTGATAAAAAGAGCTCTTAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
G G T CAT TTTCCTCT C ATAC AT AAT C AAC C CAT T GAAAT T T AAAT AC C AC AAT C AAT G T GAAG C T T 
GAAC T CC GGT T TAATATAATACCTAT T GTATAAGACCC TT TAC T GAT T CCATTAATGTCGCAT T T 
GT T T T AAGATAAAACT TC TT TCAT AGGTAAAAAAAAAAA 
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FIGURE 124 



ri 



MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGK3y[VYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARNVFAGMI RLKELHLEHNQ FSKLNLAL FPRLVSLQNLYLQWNK I S VI GQTMS WT WS S LQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CAS PKELQGVNVI DAVKNYS ICGKS TTERFDLARALPKPT FKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKI IAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 



a 

fi Important features of the protein: 

J Signal peptide: 

11 amino acids 1-33 



Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 



Tyrosine kinase phosphorylation site. 

amino acids 136-142 



N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 

CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT T GAAGAGGAAGGC T T TC TAAAGCT T TGGCAAGGAG TGACAC CCGCCATT TACAGACACG T AGT 
GTATTCTGGAGGTCGTVATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T TAGCCAAT CCAAC TGACC TAGT GAAGGT TCAGATGCAAATGGAAGGAAAAAGGAAAC TGGAAGG 
AAAACCATTGCGATTTCGTGGTGTACATCATGCATTTGCAAAAATCTTAGCTGAAGGAGGAATAC 
GAGGGCT TTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGAT TTA 
ACC AC T TAT GAT AC AG TGAAACAC T AC T T G G TAT T GAATACAC CAC T T GAG GAC AATAT CAT GAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGAC TT TT GTATAAATCATCG 
ACTGACTGCTTGAT T CAGGC TGTTCAAGGT GAAGGAT TCATGAGT C TATATAAAGGC T T TT TACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTTTAA 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMMAGVIGQFLANPTDLVKVQMQMEGKRK^ 

NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKS S TDCL I QAVQGEGFMSLYKGFLP S WLRMT PWSMVFWLT YEK IREMS GVS PF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAAT GAAGAGGACATCC TC TCCATGT TC T CAGC TATCCGT TCT CAGCACAGCGG TGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
|3 AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 

:p 

5 GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
fl AGACACAAT TCGC C T T CAAACT CCACGACAAGGACCC TGAGAAGGCAGCT GCCACC TATGAGCAA 

|1 ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
4 CATCCAGATTGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGTGACTGTGGGAGCTCC 
^ TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 

2 CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 

M 

g TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
Sj TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 

3 TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGTGGATTTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 



ry 
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FIGURE 128 



MARPGMERWRDRLALVTGASGGIGAAVARALVQQGLKW 
PYRCDLSNEEDILSMFSAIRSQHSGVDICimAGI^^ 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 



AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGT TGAGTGCT TGAAATGAGGAACTGAGAAAATTAATTTC TCATGTATTT TTCTCAT TTAT TTA 
T T AAT T TT TAAC TGATAGT TGT ACAT ATT T GGGGGTACATGTGAT AT TT GGATACATGTATACAA 
TATATAAT GATCAAAT CAGGGT AACT GGGAT ATCCATCACATCAAACAT TTAT T T T TTAT TC TT T 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
TTCCATCCATGT TGCTGCAAATGACAGGAT TTCGTTCT TAATTTCAAT TAAAATAACCACACATG 
GCAAAAA 



APP ID-1 0063557 



Page 279 of 31 8 



FIGURE 13Q 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 
amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 131 



TT CT GAAGT AAC GGAAGC T ACCTT GT AT AAAGAC CT C AAC AC T GCT GACC ATGATC AGCGCAGCC T GGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT AC C AGAGGATGCT ACAACTCTCT ACCTTC AGAAC AACC AAAT AAAT AAT G CT GGGATT CCTT CAGAT 
TT GAAAAACT TG CT GAAAGT AG AAAG AAT ATACCTAT ACC AC AACAGT TT AGAT GAATT T CCT ACCAACCT 
C CCAAAGT AT GT AAAAGAGTT ACATT TGC AAGAAAAT AAC AT AAGGACT ATC AC TT ATGATT CACT TTCAA 
AAATTC CCT ATC TGGAAG AAT T ACATTT AG AT GAC AACT C TGTC TCTGC AGTT AGCAT AGAAGAGG GAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GCCCAGGACTATAGAAGAACTACGCTTGGATGATAATCGCATATCCACTATTTCATCACCATCTCTTCAAG 
£jj GTCT CACT AGTC T AAAACGCCT GGTTCT AG AT GG AAACCT GT TGAACAAT C ATGGT TT AGGT GAC AAAGT T 

TTCT TCAACC T AGTT AATTTGAC AGAGCTGTCCCT GGT GCGGAAT TCCCT GACT GC TGCACCAGT AAACCT 
I : I TCCAGGCACAAACCT GAGGAAGCT TT AT CT T CAAGAT AAC C AC ATC AATCGGGT GC CCCC AAATGCTT TTT 

Ul CTTATCTAAGGC AGCT CT AT CGAC T GGAT ATGTCC AAT AAT AAC CT AAGT AATTT AC CT C AG GGT ATC TT T 

U1 GATG AT TTGGAC AAT ATAACACAACT GATT CTTC GC AACAAT CCCTGG T ATT GC GGGTGC AAGATG AAATG 

H GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 
^ AGGTTCGTGGGATGGCTATTAAGGATCTCAATGCAGAACTGTTTGATTGTAAGGACAGTGGGATTGTAAGC 

;•!!; ACCATTCAGATAACCACTGCAATACCCAACACAGTGTATCCTGCCCAAGGACAGTGGCCAGCTCCAGTGAC 

i 11 

%** CAAACAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 

'a i*l 

* I 'i CAATTACAATTACTGTGAAGTCTGT C ACCT CTGAT ACC AT TCAT ATCT CT TGGAAACTTGCT CT ACCT ATG 

m ACT GCT T TGAGACTCAGC T GGCTT AAACTGGGCC AT AGCC CGGC AT TT GG ATCT AT AACAGAAACAATTGT 

| s *| J AAC AGGGGAACGCAGT GAGT ACTTGGTCAC AGCC CT GG AGCCTGAT TC ACCC TATAAAGT ATGC AT GGTTC 

CCAT GGAAAC CAGCAACCT CTACCT ATT TGATGAAACT CC TGTT TGT AT T GAGACT GAAACT GC ACCCCTT 
CG AATGTAC AACCCT ACAACCACCCTC AAT CG AG AGC AAG AGAAAGAACC T T AC AAAAAC CCCAAT TT ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
T TCAT AGG AATGGATC GCT CTT CTCAAGGAACT GTGCAT ATAGCAAAGGGAGGAGAAG AAAGGATG AC TAT 
GCAGAAGCT GGCACT AAG AAGGAC AACT CT ATCC TGGAAATC AGGGAAACT T CTTT TC AGAT GT T ACC AAT 
AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 
ACAAAAACAATCACAGTGAAAGCAGT AG TAACCGAAGCT AC AGAGAC AGT GGT ATT CC AG AC TC AGAT CAC 
T CACACTCATGATGCT GAAGGACT CAC AGC AG AC T TGT GT TTTG GGTT TTTT AAACCT AAGGGAGGT GAT G 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVS I EEGAFRDSNYLRLL FLSRNHLS T I PWGLPRT I EELRLDDNRI S T I S S PSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRC31AIKDLNAELFDCKDSGIVST IQI TTAI PNTVYPAQGQWPAPVTKQPD 
I KNPKLTKDQQTTGS P SRKT IT I TVKSVT SDT IH I S WKLALPMT ALRLSWLKLGHS PAFGS ITET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSMLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPIAAIIGGAVALVTIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites . 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 132 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTC TGGGC T TGGCCAGAGC TCAGGGTGC TGAGCG T GT GACCAGCAGT GAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
y s TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
S3 CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 

G!l GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 

iTl 

V" CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 

Uj 

, * GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 

U S 

Iff AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
"•J GACCACT CCAACAGAGCCAGGACT AT AT CAACCTCT TC TGCGCCAACATGAT GGAC TTGAA.CCGC 

* AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
; !; ;| TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCT TC GGGGAGCCT GATGC TGAAGATGAAGAAT TAT CTAAA.GC TATT CAA. 
*H TATCAGCAGCAT TT T TCGAGGAGAGT GAAGAGGCGAGAAAAACAAT T T CCAGAT TC TCGCT C TGT 

TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
Ptl AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 

T TTTATAT T T T T T TAG TAGAGACAGGGT T T CAT CATGT TGCTCATGC T GGT CTC GAAC TCC TGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATT ATAG GTGTGAGCCACCGTGTCTG 
GCT GAAAAGCAC TTTCAAAGAGACT GT GT T GAATAAAGGGCCAAGGT T CT T GCCACCCAGCACT C 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 

MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
T AE SKAT I ADL I LSALERATVFLE QRLPE I NLDGMVGVRVLEE QLKS VREKWAQE PLLQPL S LRV 
GMLGEKLEAAI QRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
^ Signal peptide: 

?]l amino acids 1-26 

Ill 

HI Transmembrane domain: 

M amino acids 39-56 

j'PJ Tyrosine kinase phosphorylation sites. 

m amino acids 149-156, 274-282 

n i 

: Ht 

N-myristoylation sites. 

■■v ammo acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 
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FIGURE ias 

GGTCTGAGTGCAGAGCTGCTGTC ATG GCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCT TACAGAT TTGATCCCGTTCGAGTGGATATCACT TCGAAAGGAAAAAT GAGAGCAAGATATG 
T GAAT TACATCAAAACATCAGAGGTT GTCAGACTGCCCTATCCTCTCCAAATGAAATCT TCAGGT 
C CAC C T T C T T AC T T TAT TAAAAG G GAAT C G T GG GGC T GGAC AGAC T T T C T AAT GAACC C AAT GG T 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
C TGACAT GAGACGGGAAAT GGAGCAGT CAAT GAATAT GC TGAATT CCAACCATGAGTT GCC TGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGG TAG TCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTAT7\AACTT 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGT CAAT AT TGATGTCACTGAAT TAATTACAGTGTCCTATAGAAAATGCCATTAATAAATTAT 
ATGAAC T AC TATACAT TATGTATAT TAAT TAAAACAT C T TAAT CCAGAAAT CAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 



APP ID=10063557 



Page 285 



FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGSFWHDIPSGSYVVEVVSPAYRFDPVRVDITSKGKMRARYVNYIKTSE 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFLMN^^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

GlYcosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
C T TACCT GC TGGGCAC TAACGGC GGAGCCAGGATGGGGACAGAATAAAGGAGCCAC GACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCTGTGT GAT C GCCACAAACC T TCAGGAAATAC GAAATGGAT T TT C T GAGATAC GGGGCAGTG 
TGCAAGCCAAAGATGGAAACAT T GACAT CAGAAT C T TAAGGAGGACT GAGT CT T T GCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
AAGAAATACAGCCAGATTC TGAGT CAC T T T GAAAAGC TGGAACC TCAGGCAGCAGT TGT GAAGGC 
TTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGAATAGGAGGAAAGTGATGCTG 
CTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
TGTAATATCTT TC TGCTAT TGGATATATTTATTAGTTAATATAT TTAT TTAT TT TTTGCTATTTA 
ATGTATTTATTTTTT TACT TGGACATGAAACTT TAAAAAAATTCACAGATTATAT TTATAACCTG 
ACTAGAGCAGGT GATGTAT TTT TATACAGTAAAAAAAAAAAACC T TGTAAAT TC TAGAAGAGTGG 
CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
AT TT GAAAT TGAACCAATGACTACT TAGGAT GGG T TGTGGAATAAGT TT TGATGTGGAAT TGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAAT CC TACACGGCCAGCATGTAT T T CTACAAATAAAGT T T TC T T TGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 



MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 

S S REAWQ FFLLLWS PDFRPKMKAS S LAFS LLS AAFYLLWT P S TGLKT LNLGS C VI ATNLQE I RNG 

FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 

SLANSFLTIKKDLRLSHAHMTCHCGEEA 

E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATCCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGC T GGAAAAAGAATAGT CT T T TATGGAGAT GAAACC TGGGT TAAAT TA 
T T CCCAAAGCAT T T T GTGGAATAT GAT GGAACAACC TCAT T T TT C GT GT CAGAT TACACAGAGGT 
GGATAATAATGTCACGAGGCAT TTGGATAAAGTATTAAAAAGAGGAGAT TGGGACATAT TAATCC 
.?* TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
i; s 5 AAGC T GAG C GAGATGGACAGCG T GCT GAT GAAGAT C CAC AC C T CAC T G C AGT CGAAGGAGAGAGA 

GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
J5 GGGCC T C CT CCACCGAGGAGGT GAATACACCTC TGAT T TTAAT CAGT T C T GCGT TTGAAAGGAAA 

*H CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAG ACGGATGTGGCTGCGACACTGGCGATAGC 
Uj AC T T GGCT TAC CGAT TC CAAAAGACAGTGTAGGGAGCC TCCTAT T CCCAGT T GTGGAAGGAAGAC 

ijl CAATGAGAGAGCAGT TGAGATTT TTACATTTGAATACAGTGCAGCTTAGTAAAC TGTTGCAAGAG 

:'«§ AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
:f J GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 
!5 1 TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
3 TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
!.-;| CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
1% CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
•i*? GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
W AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
flj TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
Q CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 

TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACT CAGAT CCACAGAGCCCAGGATCAAGGGACC CAC T GCAGTGGCAGCAGGAC TGT TGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C T GAC CGAGACAC TCACAGC TT TGT CATCAGGGCACAGGC T TCC T CGGAGCCAGGATGATC TGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CT GCACACAGTAT GTAGT TACCAAAAGAATAAAC GGCAATAAT TGAGAAAAAAAA 
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FIGURE 140 

MRLGSGT FATCCVAIEVLGI AVFLRG FFPAPVRS SARAEHGAEPPAPE PSAGAS SNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGWFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVI RNLNS PALLEDS V I RQAKAAGKR I VFYGDET WVKL FPKH FVE YDGT T S FFVS DYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

m 

Transmembrane domain: 

hM 

[fl amino acids 58-7 6 

fi 

N-glycosylation sites. 

^ amino acids 56-60, 194-198 

• js 

m 

^ N-myristoylation sites. 

nj amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 

23 275-281, 278-284 

ry 

Amidation site, 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE iai 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTG ATG TTACTGCTGCTGTTGGAGTACAACT 
TCCC TAT AGAAAACAACTGCCAGCAC CTT AAGACCACT CACACC T TCAGAGT GAAGAAC TTAAAC 
C CGAAGAAAT TCAGCAT TCATGACCAGGAT CACAAAGTAC T GGT C CT GGACT CTGGGAATC TCAT 
AGCAGT T CCAGATAAAAAC TACATACGCC CAGAGATC T TC T T T GCAT TAGC CTCATCC T TGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
T G TGACAAG GAT AAAGGACAAAG TCAT CCAT CCC T TCAGC T GAAGAAGGAGAAAC T GAT GAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTTGGGGTGACAGATAAAT TTGAGAACAGGAAACACAT TGAATTT TCAT TT CAACCAGTTTGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGAT TAG GAAACTGCCCCATTGAACGCCTTCCTCGCTA 
AT T T GAAC TAAT T GTATAAAAACACCAAACC TGC TCAC T 
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FIGURE Id2 

MLLLLLSYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 

CTAGAGAGTATAGGGCAGAAGGATGGCAGATGAGTGACTCCACATCCAGAGCTGCCTCCCTTTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGT T TGC TGT CCACACAT TCAACCAACAGAGCAAGGAC TACT AT GCC TACAGACTGGGG 
CACAT C T T GAAT TCC T GGAAGGAGCAGGT GGAGTCCAAGACTGTAT T CT CAATGGAGCTAC T GCT 
GGGGAGAACTAGG TGT GGGAAAT T TGAAGAC GACAT TGACAACT GCCAT T TCCAAGAAAGCACAG 
AGCTGAACAATACTTTCAGCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCT GAACAAGACC T GCT T GGAGGGAT T CCAC TGAGTGAAACCCACT CACAGGC T TGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACATCT T GAGCC TAATCATGTAGT GTAGATCAT TAAACATCAGCAT TT TAAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE ias 



•hi 



CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCAT CAAGCAGAGAAT CAAGT GGAACCGGAAGGCCCT GCCCAGCAC T GCC CAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
: s CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
J TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 

3 AAGCTT GCCAGGAGGC TGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 

% 

■ CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 

I 

r. TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 

I 

| CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACATCAGGCAC TGCGCCACC T GC T T CACAGTACTT CCCAACAACTC T TAGAGGTAG 
GTGTAT TCCCGT TTTACAGATAAGGAAACT GAGGCCCAGAGAGCTGAAGTACTGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAAGCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
CT CTAATGAAAT TGTGAAAGCT CCAT GT T T AGAAATAAAT GAAAACACCT GA 
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FIGURE 1*6 

MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDI DFGAEGNRYYEANYWQFPDG IHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWIMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 7 6-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTC ATG TTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAAC T T TC AGAT T C AG GGG G T ACAT G T G AAG GTTTGTTT TAT GAG TAT AT T G C ATGA T GC T GAGG 
TTTGGGGT 
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FIGURE id8 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 



Important features of the protein: 
Signal peptide: 

amino acids 1-25 



y y N-myristoylation site. 

r3 amino acids 62-68 

a 

Oi- 
ly 
Ml 

m 



Q 

in 

Q 

m 

Q 

m 
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FIGURE 14Q 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGA ATG GGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCAGAAAGACCTTTGAATGAACAGATTGCTGAA 
GCAGAAGAAGAC AAGAT TAAAAAAACAT ATC C T C C AGAAAACAAGCC AGGT C AGAGC AAC TAT T CTT T 
T GTT GATAACT TGAACCTGC TAAAGGCAATAACAGAAAAGGAAAAAATT GAGAAAGAAAGACAAT CTA 
T AAG AAG C T C C C C AC T T GAT AAT AAG T T G AAT G T G G AAGAT GT T GAT T C AAC C AAG AAT C G AAAAC T G 
ATCGATGATTATGACTCTACTAAGAGTGGATTGGATCATAAATTTCAAGATGATCCAGATGGTCTTCA 
TCAACTAGACGGGACTCCTTTAACCGCTGAAGACATTGTCCATAAAA.TCGCTGCCAGGATTTATGAAG 
A7VAA.TGACAGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGC 
CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTTT TACAAAAATTAATCT CAAAGGAAGCCAACAA 
TTAT GAGGAGGAT CCCAATAAGCCCACAAGCTGGACT GAGAATCAGGCTGGAAAAAT ACCAGAGAAAG 
TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
T TAACCTTGACAAAT GGCTT GGAAAGGAGAACTAAAACCTACAGTGAAGACAACTT T GAGGAACT CCA 
AT AT T TC C CAAAT T T C TAT GC GC TACT GAAAAG TAT T GAT T CAGAAAAAGAAGCAAAAGAGAAAGAAA. 
CACT GATTACTATCATGAAAACACT GATTGACT TTGTGAAGATGATGGTGAAATAT GGAACAATATCT 
CCAGAAGAAGGTGTT TCCTACCTT GAAAACT TGGAT GAAAT GAT TGCTCTT CAGACCAAAAACAAGC T 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
C AG AC AG T AC C AAGG AAGAAG CAG C T AAG AT G GAAAAGG AAT AT G GAAG CTT G AAG GAT T C CACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CAT CAGAAAAAATAT TGAAT GGT T GAAGAAAC AT GACAAAAAG GGAAATAAAGAAGATTAT GAC C TT T 
CAAAGAT GAGAGACT TCATCAATAAACAAGCTGATGCTTAT GT GGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCAT T TATAGCAGCCTG TAAA AAT GGCAAAAGAT CCAGGAGTCT TT CAA 
CTGTT TCAGAAAACATAATATAGCTTAAAACACTTC TAATT CT GTGATTAAAATTT T TT GACCCAAGG 
GT T AT TAGAAAGT GCT GAAT T T ACAGT AGT T AACCT T T T AC AAG TGGT T AAAACATAGC TTTCTTCCC 
GTAAAAACTAT CT GAAAGTAAAGT TGT ATGTAAGCT GAAAAAA?y^AAAAAAAAAAAA 
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FIGURE ISO 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNSQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKI EKERQS I RS S PLDNKLNVEDVD S TKNRKL I DD YDS TKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS I DSEKEAKEKETLI T IMKTL IDFVKMMVKYGT I S PEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKT DE PKGKTEAYLEAI RKNI EWLPCKHDKKGNKEDYDLSKMRDFI NKQADAYVEKG I LDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites : 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCTTCTAGCTGGAGGGCTGCATGCAGGGAAGGTCATTAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGAC TAGG GCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
M CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
Q CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
pi TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
GGGTCAGTAGC T CTCCACAT GAAGT CC TGT C AC TCACCAC T GT GCAGGAGAGGGAGGT GGT CATA 
GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
W TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
i !: ff GAAAT GGC T CGAGC TCAGAAGATAAAAGATAAGTAGGGTAT GC T GAT CCTC T TTTAAAAACCCAA 

j; ff GAT ACAAT CAAAATCC CAGATGC TGGTCTC TAT TCCCATGAAAAAGT GCTCATGACATAT TGAGA 

^ I AGACC TAC T TACAAAGTGGCATATATT GCAATT TAT T T TAATT AAAAGATAC CTAT TTATATAT T 

TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
3 AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
AT TCCTTGTATAAAAATAAGAAAAGAAAT TAATCTTGAGGTAAGCAGAGCAGACATCATCTC TGA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
TTGTAGTAG TGATCAGGAAACAGAT CT CAGCAAAGC CACT GAGGAGGAGGCT GTGC TGAGT T TGT 
;!;^ GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
lV GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAG7\AGGCTGCTGTACTGGTTGA 

ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
f'jj GT CT CT GCAGAT GTAGT TAGT TAAGACAAGGTCATGC TGGATGAAGGTAGACCTAAAT TCAATAT 

GACTGGTT TCCT TGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGAC TATGTA 
AAGATGAAGGCAGAGATCGGAGTTTTGCAGCCACAAGCTAAGAAACACCAAGGATTGTGGCAACC 
AT CAGAAGCT TGGAAGAGGCAAAGAAGAAT TCT TCCC TAGAGGCT TTAGAGGGATAACGGC T C T G 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCCACCAAGGATAAT TGGT TACAGCAGCTCTAGGAAACTAATACAGCTGCTAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAAT AT GGCAGAAGTGATGG.CAT GCCACT TCCAAGAT TAGGT T ATAAAAGACAC TGCAGCT T C 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGC TAGC T GCCATGC TATGAGCAGGCCTAT AAAGAGAC T TACGT GG TAAAAAATGAAGTC TCCT 
G C CCACAG C C AC AT TAGT GAAC C T AGAAGCAGAGAC T C T G T GAGATAAT C GAT GTTTGTTGTTTT 
AAGT T GCT CAG T T T TGG TC TAAC TT GT TAT GCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 1*2 

MVLSGALC FRMKDSALKVL YLHNNQLLAGGLHAGKVIKGEE I SWPNRWLDASLS PVI LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 1*3 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTGGAGTTAGAATTGTCTGCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCG TC TCATT GGGGAGAAACT GT TCCACGGAGT CAG T AT GAGT GAGCGC T GCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT T GAAGGTGAT GACC T GCATAT CCAGAGGAATGTGCAAAAGCT GAAGGACACAG TGAAAAAGC 
TT GGAGAGAGTGGAGAGATCAAAGCAATT GGAGAACTGGATTTGCTGTTTATGTC TCTGAGAAAT 
GCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
CAAT TAGAT GCCCCAAAGCGAT T TT T T TTAACCAAAAGGAAGATGGGAAGCCAAAC TCCAT CAT G 
AT GGG T GGATT CCAAATGAACCCCT GCGT TAGT TACAAAGGAAACCAATGCCAC T T T TGT T TATA 
fl AGACCAGAAGGTAGACT T T CTAAGCAT AGATAT T TAT TGATAACATT TCAT TGTAAC TGGT GT TC 

; *f TATACACAGAAAACAAT T TAT T T T T TAAATAAT TGTC T T T T TCCATAAAAAAGAT TACT TT C CAT 

TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
TGTAT T TAT TAT TAT T AT AAGAC T GCAT T T TAT T TAT ATCAT T TT AT TAAT AT GGAT T TAT T TAT 
«i AGAAACATCAT TCGATATTGCTACTTGAGTGTAAGGCTAATATTGATAT TTATGACAATAATTAT 

|j AGAGCTATAACATGT TTAT TTGACCT CAATAAACACT TGGATATCCC 



•-"5 
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FIGURE 154 

MAALQKSVS S FLMGTLATS CLLLLALLVQGGAAAPI S SHCRLDKSNFQQPY I TNRT FMLAKEASL 
ADNNTDTOLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 



m 

: '4? 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
T GCC T GAGGGCT GTGC TGAGAGGGAGAGGAGCAGAGATGC TGCT GAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGC TAG CCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCAC T TTCTAGATAT TTCCCCCTTGCT GGAGAAGAAAGAGCCCCT GG T TT TAT T 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T TGACATGGATGAT T C TGAGGAGGAAGCT GT TAT TGAATGTAT AGAGAT TTATCCAAATAAATAT 
C T T TAT T TAAAAATGAAAAA 
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FIGURE 156 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites . 

amino acids 44-50, 150-156 
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FIGURE 157 



01 
U1 



CCGGCG ATG TCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACTTGAGGGACCTCCGAGTAGAACCTGT TACAACTAGTGTT GCAACAGGGGACTATTCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTCTATTTCATTGGGGCCCATAATATTCCTAATGCAAATATGAATGAAGATGG 
CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
GT AGAAGT GAACTTCACAACCAC T CCCCT GGGAAACAGATACAT GGC T CTTATCCAACACAGCAC 
TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
T CACACAAT TT GT TACT TCACTGAAT T TC T T CAAAACCAT TGCAGAAGT GAGGTCATCC TT GAAA 



W AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 

fli 

GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GT GAT CT AAGAAGCCAGAT TCAT C T GCACAAATACGT GGT GGTC TAC T T TAGAGAGAT T GATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NAl^EDGPSMSVNFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQH 
STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QNHC RS E V I L E KWQ KKKI AEMG P VQWL AT Q KKAADKW FL L S ND VN S VCDGTCGKS EGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPECYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

8 

\ Important features of the protein: 

Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



AGCCACCAGCGCAACATGACAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGA7VAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGTVAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
ACTCAGCT GAAGAAG 



v4 
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FIGURE 160 

MTWTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



us 



ACAC T GGCCAZ\ACAAAAACGAAAGCAC TCCG TGC TGGAAGTAGGAGGAGAG TCAGGAC T CCCAGG 
ACAGAGAGTGCACAAAC TACCCAGCACAGCC CCC TCC GCCCC CT C TGGAGGC T GAAGAGGGAT TC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCT GAAGATGAGGAAAAGT T T GGAGGAGCAGC TGAC TCAGGGGTGGAGGAGCC TAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
MJ CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
U ? l GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
j f| TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
J- 1 AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
' H CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
a G T GGC T GTACT T CACTACCCAGCAAAGCCT C CAC GAGGGCAGCTC GCCT TGGAGAGTACT TAC TA 

Q CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
\fl CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 

CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
; IJ CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 

CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
M) CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 

GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
C G GGAC T TAAAT AAAG GCAGAC GC T G T T T T T C TAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVI,NVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALVl^ACLLFAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVWLLFSP 
M* GAVALC S E WL Q DGV S G PGAH G P H D A FRASL SC VL PD FLQGRAPGS Y VGAC FD RL L H PDAV PAL FRT VP VFT 

LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

q 

. Signal sequence: 

: {•'=• amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 



Ml 



1;, - J 



N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 



cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 552-555 



N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACAT CCT GACGTGGGACAGC GGGCCAGAGGGCACCCCAGACACGG T C TACAGCATC GAG TATA 
AGACG T ACGGAGAGAGGGAC TGGGT GGCAAAGAAGGGCTGT CAGCGGAT CAC CC GGAAG TC C T GC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GT CAGT GCGGGAGGCCGGTCAGCCACCAAGAT GACT GACAGGTT CAGC TCTC TGCAGCACAC TAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
TTCTACCACTTAGAGCTCCAGGTCAACCGCACC TACCAAATGCACCT T GGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
y ; TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
*1 CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
j}2 TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
y AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
Cfi TCCACAGCGGCATAGCC TGT CCGAGAT CACCTAC T TAGGGCAGCCAGACAT CTCCATCC TC CAGC 

Lit CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
fji GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
yj CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
\jj CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
* TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 

CCCTGGGGATTTGCACAGACAGAACATCTGACCCAAATGTGCTACACAGTGGGGAGGAAGGGACA 
CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
C3. TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
fy GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
fi AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
Jj| ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
1 * AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 

AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAAT GCAGGGAAAC TCCCGAGGTCCAGAGCCCCACC T C 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAAT CTAGCTCGACAGAGCAT GAGGCCC CT GCC TCT TCT GTCAT TGT T CAAAGGT GGGAAGAGA 
GCC T GGAAAAGAACCAGGCC TGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACC TGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCT TT TCTGCAGGCAGGAGTTTCAGACCCT 
ATCCT GAGAATGGGGT TTGAAAGGAAGGT GAGGGCTGTGGCCCC TGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCT TGGGT TCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE 16 J. 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRS I QMI VHPTP TP I RAGDGHRLTLE D I FHDL FYHLELQVNRT YQMHLGGKQRE YE FFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSIiAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQAT PDSWPPS YGVCMEGS GKDS PTGTLSS PKHLRPKGQLQKE PPAGS CMLGGL S LQEVT S LAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids' 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 



TGGCCTACTGGflAAAAAAAAAAAAAAAAAAAftAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGTTGAACACAGT T TTGGATATTT TCCAAAAGATT TGATCAAGGTACT TCATAAATACACGGA 
AGAAGAGC T ACATAT TCCAGCAGATGAGACAGAC TTTGTCTGCTTT GAAGGAGGAAGAGATGAT T 
TTAATAGTTATAATGTAGAAGAGCTTTTAGGATCTTTGGAACTGGAGGACTCTGTACCTGAAGAG 
Q T CGAAGAAAGC TGAAGAAGT TT CT CAGCACAGAGAGAAAT C T CC T GAGGAG T C T CGGGGGCG TGA 

?3 ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
**| CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 

C C T GC GGCT AACGCT CAGGGAG T GCAG TC T TCGT TGGACAC T T T T GAAGAAAT TCTGCACGATAA 
\H ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
s 4 AGACAGAT GCT TACAAAG T CCT GAAAACAGAAAT GAGT CAGAGAGGAAGT GGACAGTGCGTTATT 

* CAT TACAGCAAAGGAT TTCGTT GGCATCAAAATC TAAGTT TGT T T TACAAAGAT T GT T T T TAGTA 

: 5i 2 CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 

Ml 

^ AAAAAAAAAAAAAAAAAAAA 

: .'r 

ry 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 

CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCT TGAGAATGAAGCAGAACAGAAGT TAATAGAGAGCATGT TGCAAAACCTGACAAAACCCGGG 
ACAGGGAT T T CT GATGG TGATT T CTGGATAGGGCT TT GGAGGAAT GGAGATGGGCAAACAT C T GG 
H TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
Q ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
m CTTGGGGGT CCC TACC TTTACCAGTGGAAT GATGACAGGTGTAACATGAAGCACAATTATATT TG 

S| CAAGTATGAACCAGAGAT TAAT CCAACAGCCCC T GTAGAAAAGCC TTAT C T T ACAAATCAACCAG 

J 1 ;! GAGACACCCATCAGAATGTGGT TGTTACTGAAGCAGGTATAAT TCCCAATCTAATT TATGTTGT T 

] S ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
IJl GCATAAAAGTAAAGGAAGAACAAAAAC TAGTCCAAACCAGTCTACAC T GTGGAT T TCAAAGAG TA 

iff CCAGAAAAGAAAGTGGCATGGAAG TATAATAACTCAT T GAC T TGGTT CCAGAAT T T TGTAAT T C T 

Z I GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
^ AAGAT GAAC TGTAAGG TCCCCC T T GAGGCAAATAT TAAAGTAAT T TT TATATGT C TAT TAT T TCA 

* TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
23 AC TT CAAAC T TCAAGCAAATGAAAT GGACAATGCAGATAAAGTT GT TAT CAACACGTCGGGAGT A 

y| TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
^ T G TAT AT T G TAT T GAAAT T T AC AG T GT G C AAAAG TAT T T T AC C T T T G C AT AAG T G T T T G AT AAAA 

AT GAACTGT TC TAATAT TTAT T T T TAT GGCATC TCAT T TT T CAATACAT GC T C T T T TGAT TAAAG 
nj AAAC T TAT TAC TG T TGTCAACTGAAT T CACACACACACAAATATAG TACCATAGAAAAAGTT TGT 

Q TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
r;j AAT AAGAAGCTATT TCAT TAAGT GTGATAT AAACC T CCTCAAACAT TT TACT TAGAGGCAAGGAT 

T GTCTAAT T TCAAT TG TGCAAGACAT GT GC C T TATAAT TAT TT T TAGC TTAAAAT TAAACAGAT T 
T TGTAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATT TGAACAAAAGAAG 
T GACATACACAAT ATAAATCATATGT C TTCACACGT TGCCTATATAATGAGAAGCAGC TC TC T GA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
AT T GACACT GGAGGCAGATAGT TGCAAAGT TAGTCTAAGGT T T C CCT AGCT G TAT T TAGCCTC TG 
AC TAT AT TAGTATACAAAGAGGT CAT GTGG T TGAGACCAGGTGAATAGT CAC TAT CAGTGTGGAG 
ACAAGCACAGCACACAGACATT T TAGGAAGGAAAGGAAC TACGAAAT CG TG TGAAAATGGGT T GG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
P YLTNQPGDTHQNWVTEAG 1 1 PNL I YWI PT I PLLLL I LVAFGTCC FQMLHKS KGRTKTS PNQ 
STLWISKSTRKESGMEV 



Important features of the protein: 
Signal peptide: 

amino acids 1-21 



Transmembrane domain: 

amino acids 214-235 



\B N-glycosylation sites. 

ill amino acids 86-89, 255-258 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-269 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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